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Words are not enough 


The political inertia that characterizes the world’s response to global warming cannot continue. 
Politicians and policy-makers must follow the climate’s lead — and change. 


climate change. Not surprising given that, in Doha, the United 

Nations was wrapping up the latest round of its annual politi- 
cal negotiations on a global agreement to regulate greenhouse-gas 
emissions. But the words, and the bundle of small practical actions, 
that emerged from the meeting had a familiar ring. 

“There has been, yet again, a very big mismatch between the scale 
and urgency of action required to effectively manage the huge risks 
of climate change, and the political will and ambition that has been 
displayed,” said Nicholas Stern, chair of the Grantham Research Insti- 
tute on Climate Change and the Environment at the London School 
of Economics. 

“This package offers improved continuity from existing carbon 
markets to the new markets of the future. But it still won't inspire 
action at the scale commensurate with the Copenhagen objective of 
limiting warming to 2 °C?’ said Dirk Forrister, president of the Inter- 
national Emissions Trading Association. 

Both statements tell you that the Doha talks followed the recent 
trend: warm political words but little sign of serious action. There 
was some minor progress on secondary issues, just enough to keep 
the show on the road, but little to address the core problem of soaring 
emissions. 

There were some familiar problems as well. Here’s Oleg Shamanov, 
Russian negotiator at Doha, grumbling at the way his objection to 
the final Doha text was overruled: “It has to be clearly stated that this 
is an outrageous violation and absolutely unacceptable conduct of 
business. The way those decisions were adopted extremely seriously 
undermines the legitimacy of the regime.’ And Christiana Figueres, 
executive secretary of the UN’s Framework Convention on Climate 
Change: “What we understand is that what Russia wants and needs 
is actually in those texts so my recommendation to our good Russian 
colleague is to take the time to read these texts.” 

Once again, the climate talks dragged on into the small hours, way 
past their scheduled finish time, and descended into acrimony — 
even with so little of substance on the negotiating table. A late and 
messy end is becoming as much a pre-Christmas tradition as the trees 
erected in the airports that the delegates pass through on their way 
home. “In the 16 years we have been coming to these conferences, 
there has been no global warming at all” That statement came from 
the British climate sceptic Christopher Monckton, who imperson- 
ated a delegate from Myanmar to address the conference floor, and 
did at least offer some light relief. “Can we have your conference 
pass back, please?” was pretty much the UN response. “Oh, and don't 
come back.” 

Bona fide attendees at Doha together produced a plan and timetable 
of sorts towards establishing a new agreement in 2015. In the mean- 
time, a weakened Kyoto Protocol was extended to 2020, minus the 
signatures of Russia, Canada and Japan. Talks on how the rich world 


Te past week saw a number of pronouncements on the subject of 


will finance efforts to cut emissions and adapt to changed weather 
patterns in the poor world ran into the desert sand and will be taken 

up again at the end of 2013 in Warsaw. 
Away from Doha, here is another statement on climate change: aver- 
age global temperature will rise by 0.7-1.5 °C between 1990 and 2030, 
with a best estimate of 1.1 °C. That is derived 


“The global from the very first report of the Intergovern- 
atmosphere, the mental Panel on Climate Change in 1990. In 
planet itself, , has many ways it is the first consensus predic- 
shifted while tion of climate change. We're now more than 
the politics has half way through the period covered by that 


stood still.” prediction and so far, at least, it is bang on. 

In a paper published this month, climate 
researchers David Frame and Daithi Stone analyse that original pre- 
diction — and its success (D. J. Frame and D. A. Stone Nature Clim. 
Change http://doi.org/jx6; 2012). “It seems highly likely that even in 
1990 we understood the climate system well enough to make cred- 
ible statements about how its aggregate properties would change on 
timescales out to a couple of decades,’ they write. 

And, from the same paper, here’s a line on climate that deserves 
to roll around the world. “The scientific community has now been 
working on the climate change topic for a period comparable to the 
prediction and the timescales over which the climate is expected to 
respond.” The global atmosphere, the planet itself, has shifted while 
the politics has stood still. What more is there to say? = 


Life on land 


Evidence for the first land life is controversial, 
but the fossil record has a tendency to surprise. 


hen did life first appear on land? The answer to this 
W ezssion — one of the most fundamental in science — 
rather depends on the values you choose for ‘life’ and ‘and. 
There is certainly evidence for freshwater life — pond life, 
essentially — a billion years ago or so (P. K. Strother et al. Nature 473, 
505-509; 2011). Apart from that, the evidence is indirect and inferred 
from signs of weathering of non-marine rocks and the presence of 
apparent palaeosols — sediments indicative of fossilized soil that, by 
definition, were exposed to the air. Actual fossils that might be signs 
of land life in the Precambrian eon (before 542 million years ago) are 
exceedingly rare — or, some say, mythical. 
It is a highly controversial subject, and one scientist who didnt shy 
away from controversy was the late palaeobotanist Jane Gray. Start- 
ing in the 1950s, Gray argued, often vociferously, for the presence of 
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life on land at an early date. Being female, ferocious and an advocate 
of an unpopular view, she didn’t get many grants. But as biologist 
William Shear wrote in her obituary, she “was as astute at playing 
the stock market as she was at interpreting fossil spores, and used 
her independent wealth to fund her own research” (see Nature 405, 
34; 2000). 

Gregory Retallack of the University of Oregon in Eugene, like 
Jane Gray, is unafraid to plough a contrarian furrow in this much- 
debated area. For many years now, he has worked on palaeosols 
from the Precambrian. The problem with fossil soils is that they 
are conventionally recognized by traces of the organisms that lived 
in them, especially plant roots. There lies a conundrum — how do 
you recognize a palaeosol in sediment that lacks plant roots? The 
answer comes through careful geological work, to show that the 
palaeosol is associated with rock formed under non-marine con- 
ditions, together with work on geochemistry and data from stable 
isotopes. But there might also be direct evidence in the fossil soil, 
in the form of carbonate nodules, sand crystals and cracks caused 
by desiccation or the presence of ice. 

Evidence of this sort has now led Retallack to infer the presence 
of palaeosols among rocks from the Ediacaran period (635 million 
to 542 million years ago) of South Australia. One might think that 
soils from the latest Precambrian are hardly controversial, even by 
the standards of a highly charged field. But these rocks contain fairly 
abundant evidence for the first macroscopic life — which, until now, 
was widely assumed to have been marine. 

Ediacaran rocks feature a wide range of large, distinctive and yet 
enigmatic structures, usually thought to be fossils of living crea- 
tures. Originally found in South Australia, Ediacaran fossils have 
since been discovered in localities as far-flung as Newfoundland in 
Canada, Arctic Russia and the English Midlands. 

The Ediacarans, however, were a rum lot. Although clearly highly 
organized, their precise nature has been elusive. Ifthey were animals, 
they bore little or no resemblance to any other creatures, either fossil or 
extant. This has led to suggestions that they were giant protists, fungi, 


A new iPad app for 
Nature readers 


any publishers hope that tablet renditions of newspapers and 

magazines will revive the fortunes of once-mighty but now 
financially stressed publications founded in print. Nature’s fortunes 
over the years have been relatively positive. Its print circulation 
has declined since its peak in 2002, but not as markedly as those 
of many other publications. Meanwhile, the online readership 
of Nature is more than 3 million unique users every month, and 
growing significantly year on year. 

However, there is a world of difference in user experience 
between Nature on a desktop computer or laptop and a digital 
rendition that captures the feel and easy portability of the weekly 
print issue with the added features that an online platform allows. 
Although Nature has had an iPad version since January 2011, a 
version is now being released that includes the weekly edition in 
its traditionally structured form. Existing personal subscribers to 
Nature have full access to the tablet edition. New subscribers can 
choose to buy only the iPad version at a comparatively low price 
(see go.nature.com/pvfvqy). 

Nine journals from the Nature Publishing Group are included 
in this app: Nature, Nature Biotechnology, Nature Medicine, Nature 
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algae, lichens or even a kind oflife entirely different from anything else 
known and now wholly extinct. 

Perhaps the only point of agreement is that, whatever else they were, 
the Ediacarans lived on the sandy beds of shallow, sunlit seas. This is 
where Retallack parts company with just about everyone else, because 
some of his Ediacaran palaeosols are associated with Ediacaran 
fossils. This would mean that at least some Ediacarans lived on land, 

under the sky, perhaps in the manner of 


“The stately lichens, or microbial colonies that form soil 
progr ession of crusts. The Ediacarans, then, would be the 
life from water now-not-so-rare (and not at all mythical) 
to land was not creatures that first colonized the land — not 
necessarily a just in puddles, but in soils indicative of a 


single, simple 
narrative.” 


dry, cold desert. This is as far away as imagi- 
nable from the oceanic idyll that many have 
assumed for Ediacaran organisms, and have 
reconstructed as such in a million coffee-table books. 

These conclusions are published in a paper on Nature’s website this 
week (G. J. Retallack Nature http://dx.doi.org/10.1038/nature 11777; 
2012) and will cause sharp intakes of breath in the palaeontological 
community; so much so that we have commissioned a News and Views 
Forum to air the arguments (S. Xiao & L. P. Knauth Nature http:// 
dx.doi.org/10.1038/nature 11765; 2012). 

More work — and more science — will be the only way to validate 
this challenging and exciting work. But the lesson from the past 
is clear. Jane Gray’s advocacy of land life in the relatively recent 
Ordovician period (485 million to 443 million years ago), once seen 
as off-the-wall, is now orthodoxy. There is nothing outré in principle 
in the supposition that life of some sort, however humble, lived on 
land from a very early date: the stately progression of life from water 
to land was not necessarily the single, simple narrative that seems so 
cut and dried in retrospect. The fossil record has this irritating habit: 
just when everyone thinks that the narrative has been sorted out, 
something comes along to force the story in a new, unexpected and 
breathtaking direction. m 


Physics, Nature Genetics, Nature Reviews Genetics, Nature Reviews 
Microbiology, Nature Communications and Scientific Reports. Also 
included are News articles and papers published online but not yet 
available in print. More journals will follow next year. 

At present, the accessibility of these diverse strands of content 
varies. Access for much of it is unrestricted. All of Nature’s jour- 
nalistic output is freely available. Scientific Reports is a ‘gold’ open- 
access journal — in other words, its papers are freely available, 
in their final published form, to anybody from the moment of 
publication. Nature Communications has about half of its content 
openly accessible. All of the rest is from now on available in the 
new iPad app, although so far only to personal print and iPad sub- 
scribers. We aim to provide access arrangements for readers who 
work at institutions with an online-only site licence by the middle 
of 2013. 

The journals iPad app has been designed to deliver a reading 
experience that has all the clarity and convenience of print while 
surpassing print in its various functions. Alongside the issue-based 
navigation and tables of content, the new version allows the follow- 
ing: sharing; accessible bookmarks; saved searches (synced with 
nature.com); downloading of PDF files of articles; variable font 
sizes; figures at a glance in a figure-viewing panel; downloading 
and offline access to the journals; active links for references; and 
enhanced navigation from within the article. 

We hope that those who value Nature will find its tablet 
edition even more stimulating and useful than the more traditional 
renditions. = 
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WORLD VIEW .jernisicos sen 


Bernard in An Introduction to the Study of Experimental Medi- 


cc iE is better to know nothing,’ wrote French physiologist Claude 


cine (1865), “than to keep in mind fixed ideas based on theories 
whose confirmation we constantly seek” 

Embracing a fixed idea is one of the main dangers in the evolution of 
any scientific discipline. Ideally, errors will be uncovered in the trial- 
by-fire of rigorous testing and the science will right itself: In rare cases, 
however, an entire discipline can be based on a fundamental flaw. 

Asa science journalist turned science historian, I have written at 
length about how and why this may have happened in obesity research. 
Ihave suggested that the discipline may be a house of cards — as, by 
extension, may much research into the chronic diseases associated 
with obesity, such as diabetes. 

Before the Second World War, European investigators believed that 
obesity was a hormonal or regulatory disorder. 
Gustav von Bergmann, a German authority on 
internal medicine, proposed this hypothesis in 
the early 1900s. 

The theory evaporated with the war. After 
the lingua franca of science switched from 
German to English, the German-language lit- 
erature on obesity was rarely cited. (Imagine the 
world today if physicists had chosen to ignore 
the thinking that emerged from Germany an 
Austria before the war.) 

Instead, physicians embraced the ideas of the 
University of Michigan physician Louis New- 
burgh, who argued that obese individuals had 
a “perverted appetite” that failed to match the 
calories that they consumed with their bodies’ 
metabolic needs. “All obese persons are alike in 
one fundamental respect,’ Newburgh insisted, “they literally overeat.” 
This paradigm of energy balance/overeating/gluttony/sloth became 
the conventional, unquestioned explanation for why we get fat. It is, 
as Bernard would say, the fixed idea. 

This history would be no more than an interesting footnote in 
obesity science if there were not compelling reason to believe that the 
overeating hypothesis has failed. In the United States, and elsewhere, 
obesity and diabetes rates have climbed to crisis levels in the time that 
Newburgh’s energy-balance idea has held sway, despite the ubiquity of 
the advice based on it: if we want to lose fat, we have to eat less and/or 
move more. Yet rather than blame the advice, we have taken to blaming 
individuals for not following it ‘properly. 

The alternative hypothesis — that obesity is a hormonal, regula- 
tory defect — leads to a different prescription. In 


this paradigm, it is not excess calories that cause NATURE.COM 
obesity, but the quantity and quality of carbo- _ Discuss this article 
hydrates consumed. The carbohydrate content _ online at: 

of the diet must be rectified to restore health. go.nature.com/3rt7vo 


THERE IS 
COMPELLING 
REASON 


TO BELIEVE THAT 
THE OVEREATING 
HYPOTHESIS HAS 


FAILED. 


Treat obesity as 
physiology, not physics 


The energy in-energy out hypothesis is not set in stone, argues Gary Taubes. 
Itis time to test hormonal theories about why we get fat. 


This conclusion is based on endocrinology that has been under- 
stood for 50 years: insulin regulates fat accumulation, and blood levels 
of insulin are effectively determined by carbohydrate intake. The more 
easily digestible are the carbohydrates we eat (the higher their glycae- 
mic index) and the sweeter they are (the higher their fructose content) 
the higher are our blood insulin levels, and the more fat accumulates. 

If this is true, it suggests that the obesity epidemic was caused at least 
in part by the research community’s failure to understand the nature 
of the disease, and by the food industry's exploitation of that failure. 

But is it true? Or is it the case, as conventional wisdom has it, that 
these competing hypotheses of obesity have been rigorously tested, 
and the energy-balance hypothesis has simply won out? 

Over the past year, with physician Peter Attia and support from 
the Laura and John Arnold Foundation of Houston, Texas, I have 
co-founded the non-profit Nutrition Science 
Initiative (NuSI) in San Diego, California, with 
the goal of resolving this controversy. 

Among our first tasks was to comb the medi- 
cal literature back to the 1930s, identifying all 
studies relevant to the question of whether car- 
bohydrates or excess calories cause obesity. We 
found much ambiguous smoke, but none of the 
fire of rigorous experimental evidence necessary 
to establish definitively the truth or falsehood of 
either hypothesis. This is unacceptable, consider- 
ing the critical public-health problem presented 
by obesity and diabetes. (The studies and our 
conclusions are available at http://nusi.org.) 

The trials share many shortcomings. One 
common flaw is true of all free-living diet trials: 
the investigators simply fail to control what the 
participants actually eat. The evidence suggests that few participants 
comply with the dietary advice, yet the researchers interpret the results 
as somehow speaking to the fundamental cause of obesity. It is as if we 
drew conclusions about whether smoking causes lung cancer on the 
basis of trials (poorly controlled ones, at that) of the efficacy of different 
methods of smoking cessation — nicotine patches, say, versus nicotine 
gum. This problem must be solved to establish reliable knowledge. 

NuSlaims to fund and facilitate the trials necessary to rigorously test 
the competing hypotheses, beginning with inpatient feeding studies 
that will rigidly control dietary interventions for participants so that 
we know unambiguously the effects of macronutrients — protein, fat 
and carbohydrates — on weight and body fat. These studies will be 
done by independent, sceptical researchers. This may be an idealistic 
dream, but we have committed ourselves to the effort. m 


Gary Taubes is a science writer, author of Why We Get Fat and 
founder of the Nutrition Science Initiative in San Diego, California. 
e-mail: taubes@gmail.com 
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Greenhouse gas 
finds ause 


A potent greenhouse gas that 
is a by-product of refrigerant 
production can be used to 
add a fluorine-based group 
to molecules — a desirable 
reaction in the manufacture of 
drugs and agrochemicals. 

G. K. Surya Prakash and his 
colleagues at the University 
of Southern California in 
Los Angeles report that the gas 
— which has the formula CF,;H 
and is known as fluoroform 
or HFC-23 — can be reacted 
with other molecules, 
resulting in the addition 
of CF, to carbon, silicon, 
boron or sulphur atoms. The 
reaction occurs under simple 
conditions and provides a 
rare use for a chemical with 
a global-warming potential 
11,700 times greater than that 
of carbon dioxide. 
Science 338, 1324-1327 (2012) 


Romani have 
Indian ancestry 


The 11 million members of 
Europe's largest minority 
group, the Romani (pictured), 
are descended from a single 
population that left India 
some 1,500 years ago and 
dispersed across Europe 
through the Balkans. 


Selections from the 
scientific literature 


Cigarette butts repel nest pests 


City-dwelling sparrows and finches incorporate 


the butts of smoked cigarettes into their nests, 
seemingly to ward off parasitic mites. 

Isabel Lopez-Rull and her colleagues at the 
National Autonomous University of Mexico 
in Mexico City found that nests of house 
sparrows (Passer domesticus) and house finches 
(Carpodacus mexicanus; pictured) with higher 
levels of cellulose acetate, a component of 
cigarette butts, had fewer mites. The researchers 


attached fibres from either smoked or unsmoked 


cigarette filters to parasite-attracting heat traps 
and placed them in 27 sparrow and 28 finch 


David Comas at Pompeu 
Fabra University in Barcelona, 
Spain, Manfred Kayser 
at Erasmus University in 
Rotterdam, the Netherlands, 
and their colleagues analysed 


(2012) 
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before beginning to spread 
throughout Europe around 
900 years ago. Since then, 
Romani have interbred with 
local populations in Europe. 
Curr. Biol. http://dx.doi.org/ 
10.1016/j.cub.2012.10.039 


nests. Traps bearing fibres from smoked filters, 

which contain more nicotine than those 

of unsmoked ones, captured fewer mites, 

suggesting that nicotine — and perhaps other 

compounds in cigarettes — repel the parasites. 
Birds have long been known to line their nests 

with vegetation that deters parasites, and the 

authors suggest that the use of cigarette butts is 

an urbanized form of this earlier adaptation. 

Biol. Lett. http://dx.doi.org/10.1098/rsbl.2012.0931 

(2012) 

For a longer story on this research, 

see go.nature.com/ygjvtn 


the genomes of 152 Romani 
individuals from across 
Europe and compared them 
with those of populations 
worldwide. European Romani 
probably originated from 
northern and northwestern 


Early signs of 
embryo trouble 


Human embryos with genetic 
defects may exhibit unusual 


India. cellular behaviour by the 
Genetic analysis suggests time they reach the four-cell 

that, after leaving India, stage, about 48 hours after 

Romani ancestors interbred fertilization. 

with local populations Renee Reijo Pera at Stanford 

on the way to the Balkans University in California and 


her colleagues analysed the 
genetics of 45 single-celled 
embryos left over from in vitro 
fertilization procedures. 
About 75% of the embryos 
contained an abnormal 


two carrying three copies of 


number of chromosomes, with 
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chromosome 21, which leads 
to Down syndrome. Time- 
lapse imaging revealed that 
normal embryos underwent 
precisely timed cell divisions, 
whereas 70% of embryos with 
abnormal chromosome counts 
did not. The researchers also 
observed that cells in abnormal 
embryos often fragmented 
into tiny pieces containing 
chromosomes (pictured). 
Automated image tracking of 
cell division and fragmentation 
could provide a non-invasive 
tool to screen embryos for 
healthy in vitro fertilization 
candidates, the authors say. 
Nature Commun. 3, 1251 (2012) 


QUANTUM PHYSICS 


Atom cooled to 
ground state 


Using a tightly focused beam 
of light as optical tweezers, 
researchers have confined 

a single neutral atom for 
long enough to cool it to 

its lowest-energy quantum 
state, a requirement for 
many quantum-computing 
applications. 

Researchers have previously 
cooled charged atoms ina 
similar way, but uncharged 
atoms may be more appealing 
for use in quantum devices 
because they do not interact 
with electric fields. Cindy 
Regal and her collaborators at 
JILA, a joint research institute 
of the US National Institute 
of Standards and Technology 
and the University of Colorado 
at Boulder, first used optical 
tweezers to trap an individual 
rubidium atom. Then, using 
a laser-based technique called 
Raman sideband cooling, they 
cooled the atom to its near- 
motionless ground state. 

Phys. Rev. X 2,041014 (2012) 


CLIMATE CHANGE 


Rain shifts bear 
human fingerprint 


Summer rainfall patterns in 
the Southern Hemisphere 
have changed markedly in 
response to rising greenhouse- 
gas concentrations and ozone- 
layer depletion, both caused by 
human activity. 

Since the 1960s, southern 
regions at mid-latitudes 
have become drier whereas a 
zone around Antarctica has 
grown wetter. John Fyfe at the 
Canadian Centre for Climate 
Modelling and Analysis in 
Victoria, British Columbia, 
and his colleagues found that 
the observed trends agree 
with predicted precipitation 
patterns obtained from a set of 
29 climate models. 

They found that greenhouse 
gases and ozone changes were 
primarily responsible for the 
shifts in precipitation levels. 
Moreover, natural climate 
variability cannot explain the 
observed and modelled trends, 
the team found. 

Geophys. Res. Lett. http://dx.doi. 
org/10.1029/2012GL054199 
(2012) 


Surprises beneath 
Moon’s surface 


The crust beneath the Moon's 
heavily battered surface is 
almost entirely pulverized, 
indicating that it took an 
even greater beating from 
space debris during the Solar 
System's first billion years 
than planetary scientists 
suspected. 

Maria Zuber at the 
Massachusetts Institute of 
Technology in Cambridge 
and her colleagues report 
these and other findings 
from NASA's twin lunar- 
orbiting spacecraft, known as 
GRAIL, in a trio of articles. 
The twin craft probe the 
Moon's interior by mapping 
the lunar gravitational 
field. GRAIL data have also 
revealed that the Moon's 
average crust is considerably 
thinner than previously 
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The most viewed 


CHOICE 


papers in science 


Hormone may aid monogamy 


3 HIGHLY READ 
on jneurosci.org 
in November 


Men in monogamous relationships keep 
their distance from good-looking women 
after receiving nasal puffs of oxytocin — 


the human hormone that has been linked 
to romantic attraction — whereas single men do not. 

René Hurlemann at the University of Bonn, Germany, and 
his colleagues recruited 86 heterosexual men who were either 
single or in stable monogamous relationships. The volunteers 
were asked to choose how close to stand to female or male 
experimenters, or to perform a similar task using photographs. 
Compared with single men and partnered men who had 
received a placebo, partnered men given oxytocin stood 
10-15 centimetres farther away from women they deemed 
attractive, and approached pictures of attractive women more 
slowly. No effect was seen with the male experimenters. 

The authors suggest that oxytocin could help to maintain 


fidelity in romantic relationships. 


J. Neurosci. 32, 16074 -16079 (2012) 


estimated — measuring 
between 34 and 43 kilometres 
— suggesting that some of the 
fracturing caused by space 
debris could have penetrated 
the full depth of the crust and 
into the mantle. Such deep 
fracturing, which would also 
be expected on Earth and 
Mars, could have created. 
porous structures that held hot 
groundwater for long periods, 
providing possible niches for 
life in the early Solar System. 
Science http://dx.doi. 
org/10.1126/science.1231507; 
http://dx.doi.org/10.1126/ 
science.1231530; http://dx.doi. 
org/10.1126/science.1231753 
(2012) 


MATERIALS 


Foams for cell- 
friendly scaffolds 


Porous foams with 
nanometre-scale patches to 
which cells and proteins can 
adhere could one day be used 
in tissue engineering. 

A group led by Giuseppe 
Battaglia at the University 
of Sheffield, UK, and Adam 
Engler at the University of 
California, San Diego, used 
an established technique to 
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create porous polystyrene- 
based foams (pictured). By 
incorporating various other 
polymers into their foams, 
the researchers controlled the 
surface topology of the pores 
and thus the distribution and 
size of the places where cells 
and proteins could attach over 
three dimensions. 

The technique could 
allow bioengineers to create 
self-assembling scaffolds that 
control, for example, where 
stem cells adhere. 
J. Am. Chem. Soc. http://dx.doi. 
org/10.1021/ja308523¢f (2012) 
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SEVEN DAYS nese 


Next US Mars rover 
NASA will land another rover 
on Mars in 2020 at a cost of 
US$1.5 billion, said John 
Grunsfeld, the agency's science 
chief, on 4 December. Earlier 
this year, NASA pulled out of 
the European Space Agency’s 
2016-18 ExoMars mission, 
which included a NASA rover 
intended to cache rocks before 
their eventual return to Earth, 
alongside a European life- 
detection rover. See go.nature. 
com/k4cm8b for more. 


Facilities cash 


UK chancellor George 
Osborne is investing an extra 
£600 million (US$967 million) 
in UK science. The cash, 
announced on 5 December, 
will go partly to academic 
researchers for infrastructure 
projects, and partly to 
industrial research facilities. 
The UK government has 
now announced more than 
£1.5 billion of new science 
funding over the past two 
years, but the money has yet 
to fully make up for drastic 
infrastructure funding cuts 
made in 2010. See go.nature. 
com/dw9kjn for more. 


Arctic report card 


An annual environmental 
assessment of the far north has 
revealed that the Arctic broke 
a string of environmental 
records in 2012, including 

the lowest summer extent 

of sea ice. The warming of 

the Arctic will have global 
repercussions, says the 

US National Oceanic and 
Atmospheric Administration, 
which released the report on 
5 December. See go.nature. 
com/kyxdxd for more. 


Embryo research 


A government-backed bill to 
relax rules on research using 
embryonic stem cells and 
embryos was passed by the 


NASA snaps Earth at night 


Almost exactly 40 years after Apollo 17 
astronauts took the iconic Blue Marble 
photograph of Earth, NASA has released 

a spectacular view of the planet at night, 
appropriately named Black Marble. The views, 
unveiled on 5 December, were stitched together 


French Senate last week. Such 
research is currently banned 
in France, although a 2004 law 
giving scientists dispensation 
for research that promises 
“major therapeutic progress” 
against serious diseases has 
allowed some work. The 

bill, supported by President 
Francois Hollande (see Nature 
484, 298-299; 2012), will go to 
a vote in the national assembly 
next year, before returning 

for a second reading in both 
chambers. 


Stem-cell drive 


A coalition of 10 drug 
companies and 23 academic 
institutions announced a 

plan on 5 December to create 
1,500 new induced pluripotent 
stem-cell lines. With 

€26 million (US$34 million) 
from the European Union 
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and €21 million of ‘in-kind 
contributions from the private 
sector, the StemBANCC 
project will derive cell lines 
from 500 patients and make 
them available to other 
researchers. See go.nature. 
com/ohez8u for more. 


Physics awards 
Seven scientists will share a 
US$3-million prize for their 
key roles at the Large Hadron 
Collider at CERN, Europe's 
particle-physics lab near 
Geneva, Switzerland, and UK 
theoretical physicist Stephen 
Hawking will receive the 
same amount for his insight 
that black holes should emit 
Hawking radiation. The awards 
were made on 11 December 
by the Fundamental Physics 


from images taken by a high-resolution sensor 
aboard the Suomi National Polar-orbiting 
Partnership weather satellite. Data from the 
new sensor could be used as a proxy for factors 
such as population distribution and carbon 
emissions. See go.nature.com/jfv8ql for more. 


Prize Foundation, a non-profit 
organization founded by 
Internet investor Yuri Milner. 
They follow nine $3-million 
awards to physicists made by 
the foundation in August. See 
go.nature.com/jzkdcj for more. 


Cancer sequencing 
Up to 100,000 patients with 
cancer and rare diseases 

in England will have their 
DNA mapped under plans 

to integrate whole-genome 
sequencing into the United 
Kingdom's National Health 
Service. In an announcement 
on Monday, Prime Minister 
David Cameron said that 
£100 million (US$160 million) 
had been earmarked for the 
project, which includes wider 
plans to boost genetic science 


NASA EARTH OBSERVATORY/NOAA NGDC 


LIQUID ROBOTICS 


SOURCE: FDA 


and build an infrastructure to 
feed data back to researchers. 
See go.nature.com/myvpxs for 
more. 


Big-data bonanza 
The US National Institutes 

of Health will spend about 
US$125 million a year for up to 
seven years on training and on 
upgrading its ability to analyse, 
optimize and share the gigantic 
data sets being generated 

in biomedicine, the agency 
announced on 7 December. 
From 2014, such ‘big data’ 
awards will be available mostly 
to grant applicants, but also to 
the agency’s staff. 


Desert science 


Qatar has announced plans 
to set up a climate-research 
institute in Doha. A key 
focus of the institute will be 
the effects of climate change 
on dry regions around the 
world, officials from the 
Qatar Foundation said on 

5 December, at the sidelines 
of international climate talks 
in the capital. A research 
plan and budget figures are 
to be announced once a 
founding committee has been 
established. 


Robot sailor 


A wave-powered robot 

named Papa Mau has broken 
the record for the longest 
distance travelled by an 
autonomous vehicle, its maker 
announced on 6 December. 


TREND WATCH 


The US Food and Drug 


Administration (FDA) approved 


35 new drugs in the fiscal year 
2012, the same number as in 
2011 (see chart). Half of the 


drugs were authorized through 


programmes designed to 
accelerate the process — for 
example, in cases of serious 


unmet medical need involving 


life-threatening diseases. The 


class of 2012 includes nine cancer 
drugs, nine treatments for rare, 


or ‘orphan; diseases and the 


first approved drug made from 


human umbilical-cord blood. 


The robot is one of four 


“Wave Gliders’ (pictured) 
launched by Liquid Robotics 
in Sunnyvale, California, to 
gather oceanographic data. It 
took over a year to cover the 
more than 16,000 kilometres 
between California and 
Australia. See go.nature.com/ 
jo8jsz for more. 


Drug sales concern 
On 3 December, a US 

federal appeals court 
overthrew the conviction of 
a pharmaceuticals salesman 
who had promoted drugs 

for uses that had not been 
approved by the Food and 
Drug Administration, ruling 
that a long-standing ban on 
drug marketing violated free 
speech. The pharmaceutical 
industry has already paid 
billions of dollars in penalties 
for marketing drugs this way. 
If upheld by higher courts, 
the ruling could make it 
more difficult for the US 


government to prosecute those 
who promote such ‘off-label’ 
uses of drugs. 


Genetics buyout 


Biotechnology giant Amgen 
has announced plans to 
acquire deCODE Genetics for 
US$415 million. deCODE, 
based in Reykjavik, struggled 
through bankruptcy in 2009. 
However, it has identified 

a wealth of genetic variants 
associated with conditions 
from Alzheimer’s disease to 
schizophrenia. Amgen chief 
executive Robert Bradway 
said that the acquisition would 
allow his company, based in 
Thousand Oaks, California, 
to produce more-innovative 
drugs. 


Whistleblower deal 
AUS scientific-integrity 
official who says he was fired 
for raising scientific concerns 
about a dam project (see 
Nature 484, 15; 2012) has 
reached a settlement with the 
government, a watchdog group 
representing him announced 
on 4 December. Paul Houser, a 
hydrologist at George Mason 
University in Fairfax, Virginia, 
was employed by the US 
Department of the Interior’s 
Bureau of Reclamation but 
lost his government job 

in February. He launched 

a complaint under the 
Whistleblower Protection Act, 
arare example of a scientist 
appealing to the act to protect 


US DRUG APPROVALS HOLD STEADY 


The US Food and Drug Agency approved 35 new drugs 


in fiscal year 2012. 
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SEVEN DAYS | THIS WEEK | 


15-19 DECEMBER 
The American Society 
for Cell Biology meets in 
San Francisco — with an 
address from US energy 
secretary Steven Chu on 
how physical scientists 
are changing biomedical 
science (see Nature 474, 
20-22; 2011). 
go.nature.com/xkzgw8 


17-18 DECEMBER 
With research on 
modifying avian HSN1 
flu viruses still under a 
moratorium, scientists 
and policy-makers 
meet at the US National 
Institutes of Health in 
Bethesda, Maryland, to 
discuss conditions under 
which experiments 
might restart. 
go.nature.com/Shvvzw 


his scientific disclosures. In a 
joint statement, both sides said 
they had reached agreement 
through mediation and would 
not be disclosing details of the 
settlement. 


Vaccine landmark 
A cheap vaccine against 
meningitis A last week 
reached the landmark of 
immunizing more than 100 
million people in Africa’s 
‘meningitis belt — a region 
in the north of the continent 
where the bacterial disease 
periodically kills thousands 
in intense epidemics. In 
Burkina Faso, Mali and 
Niger, where MenAfriVac 
was first rolled out in 2010 
(see Nature 468, 143; 2010), 
no cases of meningitis A had 
been detected up to October 
this year. The vaccine has 
now been launched in seven 
other countries. The project 
is led by the World Health 
Organization and PATH, 

a non-profit body based in 
Seattle, Washington. 


> NATURE.COM 
For daily news updates see: 
www.nature.com/news 
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Forty years after Apollo 17, the last mission to send astronauts to the lunar surface, the Moon still beckons the US space agency. 


| SPACE EXPLORATION | EXPLORATION 


Duelling visions stall NASA 


A US plan to send humans to explore an asteroid is losing momentum. 


BY ERIC HAND 


nce again, NASA’s human space- 
()ite programme is looking for a 

destination. It happened in the early 
1970s, after US astronauts had left the Moon 
for the last time; then in the 1990s, after the 
collapse of a costly vision of sending astro- 
nauts to Mars; and again in 2010, when US 
President Barack Obama abandoned a plan 
to return humans to the Moon because he 
did not consider it ambitious enough. He 
suggested visiting a near-Earth asteroid 


instead, but a report released on 5 December 
by the National Academies says that this plan, 
too, has misfired. 

“There is no broad acceptance of the 
asteroid as the next principal destination 
for space flight, despite the fact that the presi- 
dent has indeed said so several times,” says 

Albert Carnesale, chair- 
man of the committee 


Readmore aboutthe behind the report and 
impactoftheApollo a former chancellor of 
programme: the University of Cali- 


fornia, Los Angeles. 


For its part, NASA — whether through iner- 
tia or out of practicality — seems unwilling to 
shift the focus of its human space-flight efforts 
away from the Moon. 

Part of the tug of war over destinations is 
political. The administration’s choice of an 
asteroid is a volte-face from the ‘Moon-first’ 
doctrine espoused during the presidency of 
George W. Bush. By many accounts, Obama 
had not garnered much support for the new 
policy before springing it on NASA and Con- 
gress nearly three years ago. NASA’s admin- 
istrator, Charles Bolden, has been left to 
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MOONSTRUCK 


Why scientists yearn for the far side 


Some 60,000 kilometres beyond the far 

side of the Moon lies a spot known as ‘L2’, a 
Lagrangian point at which gravitational forces 
are balanced, so that a spacecraft could sit 
there without expending much 
fuel. The next time US astronauts 
leave low Earth orbit, L2 might just 
be their next destination. 

On 28 November, William 
Gerstenmaier, associate 
administrator for NASA’s human- 
exploration division, told the NASA 
Advisory Council that a mission 
to L2 or another Lagrangian point 
could be an early test for NASA's 
planned crewed rockets. Such 
locations could serve as staging 
grounds for other missions, he 
says: “They’re nice places to put 
cargo or samples from the Moon 
or Mars.” Gerstenmaier reckons 
that an uncrewed mission could 
get to L2 by 2017, to be followed 
by a crewed mission in 2021, but 
he is careful not to specify what activities the 
astronauts would pursue. 

David Kring wants those activities to 
include lunar science. Kring, a geologist at 
the Lunar and Planetary Institute in Houston, 
Texas, has for years been among the chorus 
of lunar scientists calling for a sample to be 
retrieved from the vast South Pole-Aitken 
basin, one of the largest impact craters in the 
Solar System (see Nature 453, 1160-1163; 
2008). Getting a firm date for this far-side 


> negotiate the rocky ground between a 
suspicious Congress and an administration 
that has largely neglected the space agency. 
Some say that Bolden — who was not the 
administration’s first choice when he was 
appointed in 2009 — may face replacement as 
Obama heads into his second term. “It’s not a 
happy situation,” says a senior astrophysicist. 
“Names are being discussed.” 

Politics aside, the asteroid proposal 
remains short on details and long on technical 
challenges, including the problem of which 
asteroid to visit. Few asteroids would be close 
enough in the mid-2020s for a crew to reach 
them within a year-long round trip, the longest 
that many experts think practical, and NASA 
has not yet made a serious effort to priori- 
tize possible targets. If the administration is 
serious about a rendezvous with an asteroid, 
Carnesale says, then “which asteroid do you 
have in mind, and when?”. 

Even if one is selected, some scientists are 
worried about the hazards of approaching 
an object that may be no more than a loosely 


adventure would help to set the time for a 
barrage of asteroids that theorists believe 
pummelled the inner Solar System about 
3.9 billion years ago. Although Kring does 


A foil-like antenna could detect radio waves from the distant Universe. 


not expect funding for a human landing, 

he thinks that it could be possible for the 
2021 mission to fly a small robotic lander 
carrying a rover capable of collecting 
samples for later retrieval. Astronauts 
circling overhead at the L2 point could tele- 
robotically operate the rover with a joystick — 
an activity that many think will be necessary 
practice for any future operations at Mars. 
“The lunar far side is terra incognita,” says 
Kring. “We've never been there.” 


bound pile of rubble — that could also be 
spinning dangerously fast. And how would 
astronauts anchor themselves to the poten- 
tially friable surface without kicking up dust? 
asks Stephen Mackwell, director of the Lunar 
and Planetary Institute in Houston, Texas. 
“We have problems enough at the Moon 
with dust.” 

Then there is the problem of just getting 
there. NASA is increasingly concerned about 
the radiation exposure and bone loss that 
astronauts might face during a long voyage 
outside Earth’s protective magnetosphere. 
“You get a bad solar storm and youre toast,” 
says Mackwell. 

Meanwhile, Carnesale notes that private 
companies and other nations are mobilizing 
to visit the Moon. And NASA itself has not 
written off the Moon either. On 28 November, 
William Gerstenmaier, associate administra- 
tor for NASA’s human-exploration division, 
presented plans to test the agency’s next 
generation of heavy-lift rockets with trips to 
the vicinity of the Moon in 2017 and 2021. 
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Jack Burns, an astrophysicist at the 
University of Colorado at Boulder, wants to 
use the same rover to lay down a thin, foil-like 
antenna that would receive low-frequency 
radio waves from the distant 
Universe. He says that the lunar 
far side, shielded from Earth 
noise, is the ideal site for an array 
of such antennas, which could 
search for radio signals from the 
hydrogen atoms that formed soon 
after the Big Bang. The signals 
would be similar to the ubiquitous 
21-centimetre emission that 
originates from hydrogen in 
the Milky Way, but shifted to 
much longer wavelengths by the 
expansion of the Universe. Burns 
would look for a change in the 
hydrogen signals that would mark 
the moment the first stars turned 
on, and indicate how quickly 
their ultraviolet light ionized the 
hydrogen that makes up the 
bulk of intergalactic space today. “We’re 
reaching into territory that will be difficult, if 
not impossible, for James Webb,” he says, 
referring to NASA's 6.5-metre infrared space 
telescope, which is scheduled to launch 
in 2018. 

Stephen Mackwell, director of the Lunar 
and Planetary Institute, says that such 
ideas show how the Moon is still relevant for 
scientific inquiry. “I get ticked off when people 
say, ‘Been there, done that’.” E.H. 


Although those missions could be seen as step- 
ping stones to a later asteroid mission, many 
lunar scientists view the region as a destination 
in its own right (see ‘Moonstruck’). 

But Mark Sykes, president of the Planetary 
Science Institute in Tucson, Arizona, and chair 
of NASA’ Small Bodies Assessment Group, 
remains a big fan of asteroids. He notes that 
human explorers could search for resources 
such as water. Scientists could seek to under- 
stand the subtle pressure of light that causes 
asteroids to change their spin, and could 
retrieve samples for dating and chemical anal- 
ysis that would offer a clearer picture of Solar 
System material than do meteorites, which, 
although they are pieces of asteroids, are altered 
during their fall through Earth’s atmosphere. 

Butall this could be done more cheaply with 
a robotic mission, says Sykes. Without a sus- 
tained drive towards something bigger — such 
as a human presence on Mars — even Sykes 
isn't terribly excited. “You go to an asteroid, 
then what?” he says. “If it’s all performance 
art, that’s not much of a mission.” 
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Ancient fungi found 
in deep-sea mud 


Discovery raises hopes that sea floor could yield previously unknown antibiotics. 


BY RICHARD MONASTERSKY 


esearchers have found evidence of 
R= thriving far below the floor of the 

Pacific Ocean, in nutrient-starved sedi- 
ments more than 100 million years old. The 
discovery has the potential to turn the brown 
muck of the sea floor into pure gold for biolo- 
gists looking for alternative forms of life — and 
possibly for pharmaceutical companies seek- 
ing antibiotics to combat the growing problem 
of drug-resistant bacteria. 

“This is adding a new family of potential 
drugs,’ says Brandi Reese, a biogeochemist at 
the University of Southern California in Los 
Angeles who studied the fungi, some of which 
belong to the genus Penicillium, the source of 
penicillin. Finding multicellular organisms in 
such a deprived environment “extends what we 
understand about the limits of life on the planet”, 
says Heath Mills, a molecular geomicrobiologist 
at Texas A&M University in College Station, 
who collaborated with Reese. They reported 
their results on 6 December at a meeting of the 
American Geophysical Union (AGU) in San 
Francisco, California. 

To follow up on earlier reports of deep-sea 
fungi, Reese and her colleagues studied sedi- 
ments pulled up from as deep as 127 metres 
below the sea floor during an expedition of the 
Integrated Ocean Drilling Program in the South 
Pacific in 2010. They searched the samples for 
fungal genetic material and found sequences 
from at least eight groups. The team succeeded 
in growing cultures of four of the fungi. 

A decade ago, the only organisms known 
to live in deep layers of sediment were single- 
celled — bacteria and archaea. Hints of fungi 
in sediments started emerging in 2005, but just 
a handful of researchers are studying those 
sediment dwellers and the other fungi living 


> 


MORE 
ONLINE 


in ocean water. “These things are ubiquitous 
and they’re largely ignored,” says Jennifer 
Biddle, a microbial ecologist at the University 
of Delaware in Lewes, who was among the 
first to culture fungi from sea-floor sediments 
(J. FE. Biddle et al. Geobiology 3, 287-295; 2005). 

Some biologists have been sceptical about 


°F ae: 


Penicillium fungi have been grown from sediments. 


such reports, suggesting that the material might 
be the result of contamination — or inactive 
spores of surface fungi that became trapped in 
the muck. But Reese and her colleagues say that 
they took several steps to rule out contamina- 
tion. Steven D'Hondt, who studies microbes in 
sediments at the University of Rhode Island in 
Narragansett, says that Reese’s group and others 
are “accumulating a lot of evidence that there 
are fungi in the deep sediments”. 

And the material seems to be more than 
spores, says William Orsi, a molecular 


ecologist at the Woods Hole Oceanographic 
Institution in Massachusetts. Working with 
Biddle and Virginia Edgcomb, a microbiologist 
at Woods Hole, Orsi has studied deep-sea sedi- 
ments from off the coast of Peru and detected 
bits of fungal messenger RNA that code for 
several proteins, including those involved in 
transporting ions and metals across mem- 
branes. This shows that the subsurface fungi 
are metabolically active, says Orsi, who pre- 
sented the work at the AGU meeting. 

Now that the evidence in favour of fungi in 
deep sediments is accumulating, “the press- 
ing question is: what are they doing?” says 
Orsi. The sediments studied by Reese and her 
colleagues underlie the South Pacific Gyre — 
“One of the most dead places on the planet,” 
says Mills. The gyre is far from land, so few 
nutrients get there. Marine life is scarce and 
microbes in the surface sediments devour the 
scant organic matter that sinks to the bottom. 

Mills thinks that the fungi might have a key 
role in the nutrient-starved deep ecosystem. 
Researchers have previously presumed that the 
organic matter left in sediments after millions 
of years is too difficult for most single-celled 
organisms to consume. But fungi are masters at 
breaking down tough organic molecules, says 
Mills, so they could be providing sources of 
food for microbes far below the sea floor. 

The team says that it is not clear whether 
the fungi in the deepest sediments are more 
than 100 million years old; they might have 
colonized those layers by moving in from 
younger deposits. But if the fungi have 
been isolated for a long time, they may have 
evolved unusual biological defences against 
bacteria and could provide a source of useful 
antibiotics. “What if this is a new version of 
penicillin?” asks Mills. “That’s one of the ben- 
efits of going to the deep biosphere.” = 


| MORE NEWS | | VIDEO | 
Porcupine @ Brain cells made from urine go.nature. Airborne 
quills’ com/de36vx observatory 
vicious @ James Cameron reveals some of reveals 
secret the results from his deep dive go.nature. drought 
revealed com/aihhdy damage 
go.nature.com/ | @ Pharmaceutical lab loses valuable go.nature.com/ 
tyGd4f gold stash go.nature.com/hycw71 hbfgno 
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Despite being insulators inside, samarium hexaboride crystals can conduct electricity on their surface. 


CONDENSED-MATTER PHYSICS 


Hopes surface for 
exotic insulator 


Findings by three teams may solve a 40-year-old mystery. 


BY EUGENIE SAMUEL REICH 


compound whose odd electrical 

behaviour has puzzled physicists for 

decades could turn out to be a boon 
for quantum physics and electronic-device 
makers. 

When theorists proposed in 2005 that it 
should be possible to find materials that con- 
duct electricity at the surface while the rest 
of the sample behaves as an insulator, physi- 
cists were intrigued. They wanted to study 
the quantum effects that should emerge in 
such materials, and to explore applications in 
low-power electronics and quantum comput- 
ing. But topological insulators, as the materi- 
als were called, proved fiendishly difficult to 
make. Some researchers have slaved to pro- 
duce thin films using complex techniques 
that are unlikely ever to scale up to the levels 
needed for industrial purposes. Others have 
contented themselves with compounds that 
approximate topological insulators but still have 
a degree of internal conductivity. 

Now, three papers” suggest that samarium 
hexaboride, a poorly understood compound 
that was first found to gain conducting prop- 
erties at very low temperatures* in 1969 by 
researchers at Bell Labs in New Jersey, may in 
fact be a topological insulator in its bulk form. 

In the most recent paper’ in the trend, 
posted online on 28 November, researchers 


at the University of California, Irvine, report 
seeing remarkably fast-moving electrons on 
the surface of SmB, crystals, which they take 
as a sign of a superb surface conductor. Five 
days earlier, researchers at the University of 
Maryland in College Park had reported meas- 
urements tracing the path of electrons injected 
into SmB, samples as they were cooled’. Those 
results suggest that the material is insulat- 
ing in its interior at temperatures below 
around 30 kelvin. And, in a paper posted on 
21 November’, scientists from the University 
of Michigan in Ann Arbor and University of 
California, Irvine, describe their measure- 
ments of conductivity through the surface and 
bulk of the material, and find evidence that the 


CHARGING UP 


The number of papers published on topological 
insulators has grown rapidly over the past few years. 
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surface conducting behaviour persists despite 
imperfections and impurities, as would be 
expected from a true topological insulator. 

A spurt of interest in topological insulators 
over the past few years (see ‘Charging up’) led 
to a 2010 prediction that SmB, would be such 
a material’. “I'd say we've been tentatively vin- 
dicated,” says Piers Coleman of Rutgers Uni- 
versity in Piscataway, New Jersey, one of the 
four theoretical physicists who made the pre- 
diction. “We're thrilled by these new results.” 


COOL CHARACTERISTICS 

The prediction grew, in part, from studies of 
materials known as Kondo insulators, which, 
unlike ordinary insulators, retain some of the 
small amount of conductivity they do have 
when they are cooled to a few degrees above 
absolute zero. SmB,, which is often categorized 
as a Kondo insulator, fits this description. 

Coleman and other theorists realized that 
the material’s behaviour would make sense 
if it were a topological insulator. That would 
mean that the quantum properties of the 
material would be such that electrons can- 
not flow through it freely, as they would in 
an ordinary conductor, except at the mate- 
rial’s surface. If this proves correct, Coleman 
thinks that insights gleaned from SmB, and 
other Kondo insulators could carry over to 
all topological insulators. 

SmB, is an unusual topological insulator 
because the electrons in the outer shells of the 
samarium atoms interact with one another 
strongly, such that a coordinated motion 
emerges. This could make the material use- 
ful for creating some exotic quantum effects, 
including magnetic monopoles, or Majorana 
fermions — quasiparticles that might be use- 
ful for quantum computing, says Shoucheng 
Zhang, who has pioneered work on topological 
insulators at Stanford University in California. 
Zhang adds that the rush of interest in SmB, is 
part ofa trend to study materials with electrons 
that interact strongly with each other. “Now 
were looking at a number of systems. It’s a very 
exciting development,’ he says. 

Peter Armitage, who has been working on 
topological insulating behaviour in bismuth- 
based compounds at John Hopkins University 
in Baltimore, Maryland, says that in the field 
of condensed-matter physics, experiment usu- 
ally leads theory, but this is a remarkable exam- 
ple of the opposite. He is now hoping to start 
experiments on SmB, in the next week or two 
to confirm and study the surface states. “These 
are beautiful effects that were hiding under our 
noses,’ he says. “This is a very big advance.’ = 
1. Botimer, J. et al. Preprint at http://arxiv.org/ 
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Anew coronavirus found in the Middle East is one of a family of viruses named after the corona-like appearance of their surface spikes. 


Clusters of coronavirus 
cases put scientists on alert 


Surveillance ramped up after novel virus is identified in three Middle Eastern countries. 


BY DECLAN BUTLER 


Middle East a simmering threat to global 

public health or a viral footnote that will 
end up being of interest only to academics? 
That is the key question that researchers and 
public-health officials are now struggling to 
resolve. Two unsettling developments have 
fed their concerns: fresh hints that the virus 
might show at least limited spread from per- 
son to person, and signs that it has extended 
its geographical range. 

Last week, the World Health Organization 
(WHO) urged that surveillance for the virus, 
called human betacoronavirus 2c EMC/2012, 
should be extended to all countries worldwide, 
with a special focus on all clusters of severe 
pneumonia, particularly in health workers. 
Epidemiologists say that the nebulous threat 
requires close monitoring, by investigating and 
controlling any new clusters of human cases that 
could signal that the virus has adapted to spread 
between people more easily. 

The coronavirus, first reported on 
20 September, causes severe pneumonia and 
often kidney failure. In the nine cases con- 
firmed so far, it has had a death rate of more 
than 50%. “It’s an extremely serious disease; 


I s the coronavirus that has appeared in the 


it’s very much in the category of H5N1 [avian 
influenza],” says David Heymann, chairman 
of the UK Health Protection Agency and for- 
mer head of the communicable-diseases pro- 
gramme at the WHO during the 2003 SARS 
epidemic, caused bya different coronavirus. Yet 
this latest coronavirus is still limited to the Mid- 
dle East and shows no sign of spreading easily 
between people, Heymann emphasizes. 

In late November, 


“It’sanextremely however, the WHO 
serious disease; reported a house- 
it’s very muchin hold cluster of four 
the category of cases in Saudi Ara- 
HSNI1.” bia, in which two 


people died. Retro- 
spective tests looking for the coronavirus in 
samples from a cluster of 11 people who were 
admitted with serious respiratory symptoms 
to a hospital in Jordan in April have revealed 
the virus in another two fatal cases. This pushes 
back the first known date of its emergence by 
several months and marks the first evidence for 
infection outside Saudi Arabia and Qatar. 
Although independent infections from the 
same animal or environmental source might 
explain these outbreaks, such clusters always 
raise the possibility of transmission between 
humans. Because the disease did not spread 
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any further, however, person-to-person 
transmission of the virus — if it occurred at 
all — would have had to have been through 
close contact. Still, the extension of the 
disease to Jordan, and the identification of these 
earlier cases, marks a significant epidemiologi- 
cal development and suggests that infections 
may have gone undetected in other countries. 
What is needed now, says Heymann, is 
“good old shoe-leather epidemiology; that’s 
what worked in SARS”. Epidemiologists need 
to talk to the families and neighbours of any 
infected people; try to track down everyone 
who has come into contact with them to look 
for symptoms and test for the virus; and isolate 
infected people to try to stop any spread. 
Researchers including Christian Drosten, 
director of the Institute of Virology at the Uni- 
versity of Bonn Medical Centre in Germany, 
have moved quickly to develop diagnostic tests 
to determine whether people or samples are 
infected with the coronavirus’. These tests are 
now being rolled out by many national health 
authorities, and Drosten says that a survey in 
Europe is showing a par- 


> NATURE.COM ticularly quick response. 
For a coronavirus Teams from the WHO 
case timeline, see: and the US Centers for 
go.nature.com/xtr5wj Disease Control and 
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Prevention in Atlanta, Georgia, have been 
helping countries in the Middle East to boost 
surveillance. During the annual hajj religious 
pilgrimage in Saudi Arabia in late October, 
when millions of people visited the country 
before returning home — a potential recipe 
for global virus spread — both Saudi Arabia 
and return countries were on high alert. 

Beyond immediate surveillance efforts, 
increased monitoring for coronavirus anti- 
bodies in the wider population should help 
to pin down the true death rate. The current 
high death rate would fall, for example, if 
many people in the vicinity of cases turn out 
to have antibodies to the virus but have not 
developed serious disease. 

Meanwhile, laboratory studies are 
shedding light on the virus itself. This 
week, a team including Drosten has shown 
that this coronavirus seems to differ from 
the SARS virus in some important ways’. 
Because the SARS virus binds to the ACE-2 
receptor on human cells deep in the lungs, 
it causes serious disease in the lower respira- 
tory tract but is relatively difficult to con- 
tract and is not easily spread by coughing or 
sneezing. The researchers have determined 
that the new coronavirus does not bind to 
the ACE-2 receptor, but declined to elabo- 
rate on which receptor it does use, pending 
a separate publication. But ruling out the 
ACE-2 receptor has immediate practical 
implications — all the reagents and thera- 
peutic strategies developed for the SARS 
virus will be of little use with this latest 
virus, says Drosten. 

The same team also found that cells from 
bats, pigs and humans can all be infected in 
the lab with the coronavirus. This suggests 
that it is “promiscuous among mammals’, 
says Drosten, and that it might jump read- 
ily between mammal species, although he 
cautions against extrapolating such in vitro 
studies to the real world. 

More clues have come from Ian Lipkin, 
a renowned ‘virus hunter’ from Columbia 
University in New York, who was invited 
by the Saudi government in October to 
study the first cases there. He told Nature 
that he has discovered that partial genetic 
sequences of a virus isolated from bats 
match the coronavirus found in humans. 
“The finding gives insight into the original 
source of the virus,’ he says. 

The route taken by this coronavirus to 
infect humans is still not clear. It could, for 
example, be carried by intermediate animal 
hosts, or in foodstuffs contaminated by the 
faeces of bats or other infected animals. To 
answer that, says Lipkin, “one would need 
to work for months”. He hopes to send in 
another team to work alongside his Saudi 
colleagues in the near future. = 
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NIH tackles major 
workforce issues 


US agency aims to expand training options for graduate 
students and to increase demographic diversity. 


BY MEREDITH WADMAN 


he world’s largest biomedical research 
Tiina agency last week announced 

a programme of nudges, incentives 
and requirements intended to respond to two 
thorny workforce-related problems: the over- 
supply of young scientists who face diminish- 
ing prospects of landing academic jobs, and 
the lack of demographic diversity in the US 
biomedical workforce. 

Two reports issued in June challenged the 
US National Institutes of Health (NIH) to bet- 
ter prepare young biologists for an evolving 
and increasingly competitive job market, and 
to boost diversity (see Nature 486, 304; 2012). 
In response, the agency, based in Bethesda, 
Maryland, will spend an average of US$50 mil- 
lion a year for 10 years on a grant programme 
designed to provide financial support and 
mentoring to undergraduates at less-research- 
intensive institutions, which tend to have more 
diverse student populations. Another set of 
up to 50 grants, each worth about $250,000, 
will entice institutions to launch innovative 
training programmes that prepare students 
for careers outside academia, in everything 
from science policy to industry and research 
administration. The agency will also encour- 
age universities and other institutions to limit 
NIH-supported doctoral studies to five years. 

To measure the success of the new initiatives, 
the NIH will require universities and academic 
medical centres to track the careers of all of 
their NIH-funded graduate students and post- 
docs for 15 years after they are trained. “The 
challenges facing the biomedical workforce 
are complex and these proposed initiatives are 
designed to effect real change,” says Francis 
Collins, director of the NIH. “I’m optimistic 
that the entire scientific-research community 
will come together to help implement them” 

The National Medical Association (NMA) in 
Silver Spring, Maryland, the main group rep- 
resenting black medical scientists, praised the 
agency's diversity plans, which also include a 
nationwide mentoring consortium, training on 
diversity issues for NIH review staffand mem- 
bers of peer-review panels, and a pilot scheme 
to assess anonymized grant applications. These 
initiatives were spurred by a study showing 
that, when other factors such as education are 


controlled for, black applicants are 10% less 
likely than whites to win NIH grant funding 
(D. K. Ginther et al. Science 333, 1015-1019; 
2011). Rahn Bailey, president of the NMA, 
called the plans “a move in the right direction’. 

But Shirley Tilghman, who co-chaired the 
working group that wrote the June workforce 
report, said that simply encouraging institu- 
tions to limit the length of doctoral studies 
isn't enough. Tilgh- 
man, a molecular 
biologist and the 
president of Prince- 
ton University in New 
Jersey, told Collins's 
advisory committee 
on 6 December that 
“unless you get a stick, 
this won't happen”. 
Her group’s report 


m noted that the number 
4 These proposed  ofUSPhDbiomedical 
initiatives graduates who secure 
are designed tenure-track or ten- 
to effect real ured jobs has fallen 
change.” from 34% in 1993 to 


Francis Collins 26% today. She said 
that unless the NIH 
takes tougher steps, “we will be looking at data 
that look just like this ten more years from now”. 

Neither the June report nor the NIH response 
answer one key question — how many biomedi- 
cal PhDs the United States should be training 
— because the necessary data and workforce- 
modelling expertise were not available either 
to the report's authors or to NIH staff. Indeed, 
the NIH also plans to hire at least one labour 
economist, to work as part ofa new unit devoted 
to modelling biomedical-workforce needs. 

As part of its workforce initiative, the NIH 
will increase yearly stipends for the 35,000 
postdocs it supports from $39,000 to $42,000, 
a 7.7% rise, in an effort to reflect postdocs’ 
extensive training and to discourage lab heads 
from prolonging the postdoctoral training 
period. The National Postdoctoral Association 
in Washington DC applauded the NIH’s plans. 
“The NIH has really stepped up to the plate in 
regard to early-career scientists,” says Cathee 
Johnson Phillips, the group's executive direc- 
tor. The last time NIH-funded postdocs got a 
significant pay rise was in 2003. = 
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that he has discovered that partial genetic 
sequences of a virus isolated from bats 
match the coronavirus found in humans. 
“The finding gives insight into the original 
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The route taken by this coronavirus to 
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answer that, says Lipkin, “one would need 
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Cash injection set to 
revive Swiss drug site 


Merck Serono facility could become biotech research hub. 


BY DANIEL CRESSEY 


ne of the largest philanthropic 
( )icsssin in European research 

history could give some former phar- 
maceutical-company laboratories fresh life as 
a centre for bioengineering. 

In April, German pharmaceutical com- 
pany Merck KGaA announced that it was 
closing the headquarters of Merck Serono, 
its drug-development unit based in Geneva, 
Switzerland, as part ofa restructuring effort 
that cut about 500 jobs. 

But last week, Swiss billionaire business- 
men Hansjorg Wyss and Ernesto Bertarelli 
rode to the rescue with an undisclosed offer 
to buy the 45,000-square-metre site close to 
the shores of Lake Geneva. They plan to give 
the laboratory facilities to the University of 
Geneva and the Swiss Federal Institute of 
Technology in Lausanne (EPFL), under the 
umbrella of a new Wyss Institute, and would 
fund research at the institute with a donation 
of 125 million Swiss francs (US$133.9 mil- 
lion) over six years. 

“We've seen a lot of these gifts going to the 
United States,” says Patrick Aebischer, presi- 
dent of the EPFL, but donations of this size are 
vanishingly rare in Europe. A 2011 report from 
the European Commission, for example, noted 
that “philanthropic fundraising is not, on the 
whole, taken seriously in European universi- 
ties” (B. Breeze et al. Giving In Evidence Euro- 
pean Commission; 2011). 

The Swiss Wyss Institute would house 
about ten life-science laboratories, led by new 
research chairs affiliated to the universities, 
and would provide jobs for up to 150 people. 
Merck Serono’s state-of-the-art laboratories 
would not need to be refitted, says Jean- 
Dominique Vassalli, rector of the Univer- 
sity of Geneva, so “I would guess within six 
months or a year we can have people moving 
in and working there. That’s an extraordinary 
opportunity.” The labs would focus on areas of 
overlap between medical research and engi- 
neering, such as tissue regeneration, implants 
and transplants. 

Start-ups and other 


biotechnology busi- DNATURE.COM 
nesses could also be Read more about 
hosted on the site, which __ philanthropy and 
is close to many existing science: 
companies. Wyss and _ go.nature.com/hig27t 


Bertarelli were unavailable for interviews, but 
a spokeswoman for the project says that they 
have not finalized their plans for the remainder 
of the site. A property consultancy is reviewing 
the proposal on Merck KGaA’s behalf. 

The move is the latest in a string of projects 
to transform former pharmaceutical-company 
sites into academic-industrial research hubs. 
In 2007, Yale University in New Haven, Con- 
necticut, paid $109 million for a local site 
vacated by Bayer. Two years later, the Univer- 
sity of Michigan in Ann Arbor handed over 
$108 million for a 28-building former Pfizer 
site, which the university turned into the North 
Campus Research Complex. And since Pfizer 
announced that it was pulling out of a large 
site in Sandwich, UK, last year (see Nature 470, 
154; 2011), the British government has been 
trying to foster a science park there. 

Both of the Swiss 


“Within a year philanthropists 
we can have certainly have the 
people moving experience and the 
in and working resources to make 
there. That’san their plan work. Ber- 
extraordinary tarelli’s grandfather 


founded Serono, and 
Bertarelli was chief 
executive of the com- 
pany before selling it to Merck KGaA in 2007. 
And in 2009, Wyss made a similar endowment 
to create the Wyss Institute for Biologically 
Inspired Engineering at Harvard University 
in Boston, Massachusetts. 

Donald Ingber, director of the Harvard 
institute, says that it brings together former 
industry researchers, who have product- 
development experience, with scientists work- 
ing in basic biology. “What we've created is 
a start-up culture within an academic envi- 
ronment,” he says. Although it is not yet clear 
whether a future Swiss Wyss centre would 
follow the same model, Ingber says, “I could 
see potentially great things coming out of that 
place” = 


opportunity.” 


CORRECTION 

The source for the graphic ‘Urban emitters’ 
in the News story ‘Megacities move to track 
emissions’ (Nature 492, 20-21; 2012) 
should have been P. Romero Lankao et al. 
Clim. Res. 38, 17-29 (2008). 
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Submersible pilots from the University of Ghent in Belgium find invading crustaceans at Antarctica’s Palmer Deep in 2010. 


TROUBLE BARES ITS CLAWS 


Crabs invading 
the Antarctic 
continental shelf 
could deal a 
crushing blow to 
a rare ecosystem. 


BY DOUGLAS FOX 


na dim February evening, seven 

people crowded around a row of 

television monitors in a shack on 

the rear deck of the RV Nathaniel 
B. Palmer. The research icebreaker was idling 
30 kilometres off the coast of Antarctica with 
a cable as thick as an adult's wrist dangling 
over the stern. At the end of that cable, on the 
continental shelf 1,400 metres down, a remote- 
operated vehicle (ROV) skimmed across the 
sea floor, surveying a barren, grey mudscape. 
The eerie picture of desolation, piped back to 
the television monitors, was the precursor to 
an unwelcome discovery. 

The ROV had visited 11 different sea- 
floor locations during this 57-day research 
cruise along the Antarctic Peninsula in 2010. 
Each time, it had found plenty of life, mostly 
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invertebrates: sea lilies waving in the currents; 
brittlestars with their skinny, sawtoothed arms; 
and sea pigs, a type of sea cucumber that lum- 
bers along the sea floor on water-inflated legs. 
But at this spot, they were all absent. After 
15 minutes, the reason became clear: a red- 
shelled crab, spidery and with a leg-span as 
wide as a chessboard, scuttled into view of the 
ROV’s cameras. It probed the mud methodi- 
cally — right claw, left claw, right claw — look- 
ing for worms or shellfish. Another crab soon 
appeared, followed by another and another. 
The crowded shack erupted into chatter. 
“They're natural invaders,’ murmured Craig 
Smith, a marine ecologist from the University 
of Hawaii at Manoa. “They're coming in with 
the warmer water.” 

Cold temperatures have kept crabs out of 


Antarctic seas for 30 million years. But warm 
water from the ocean depths is now intrud- 
ing onto the continental shelf, and seems to be 
changing the delicate ecological balance. An 
analysis’ by Smith and his colleagues suggests 
that 1.5 million crabs already inhabit Palmer 
Deep, the sea-floor valley that the ROV was 
exploring that night (see ‘A warming wel- 
come’). And native organisms have few ways 
of defending themselves. “There are no hard- 
shell-crushing predators in Antarctica,” says 
Smith. “When these come in they’re going to 
wipe out a whole bunch of endemic species.” 

Researchers are worried that rising crab 
populations and other effects of the warm- 
ing waters could irrevocably change a sea- 
floor ecosystem that resembles no other on 
Earth. Scientists are racing to document these 
effects, even as they continue to explore this 
little-understood region. “This could have 
a really major reorganizing impact on these 
unique and endemic marine communities,” 
says Richard Aronson, a marine biologist at the 
Florida Institute of Technology in Melbourne, 
who was part of a team that found crabs on 
another part of Antarctica’s continental shelf 
in December 2010 (ref. 2). “It’s a fascinating 
thing?’ he says. “A little scary, because it’s a very 
obvious footprint of climate change.” 


CUT OFF BY COLD 
Aronson has worried about the fragility of life 
on the Antarctic shelf for more than a decade. 
He spent December 1994 collecting fossils from 
Seymour Island, on the northeast fringe of the 
Antarctic Peninsula. The island’s bare, crum- 
bling hills contain the remnants of an ancient 
sea floor. In 200 metres of layered rock and fos- 
sils exposed by wind erosion, Aronson saw evi- 
dence of the most pivotal event in Antarctica’s 
history: the continent's final separation from 
South America, starting around 40 million 
years ago. This event allowed the emergence of 
the circumpolar ocean current, which isolated 
Antarctica from warmer air and water masses 
farther north, and plunged it into perpetual 
winter. Aronson and his students analysed 
10,000 fossils from before and after that sudden 
cooling, and a striking pattern emerged. 
As temperatures fell, the sea floor bloomed 
with soft-bodied echinoderms — invertebrates 
including starfish, brittlestars, sea lilies and sea 
cucumbers. At the same time, crush wounds 
caused by crabs or sharks on the arms of fossil 
starfish and sea lilies became rare — evidence 
that these predators were declining’. 
Crabs and lobsters were probably 
excluded by a physiological quirk. At tem- 
peratures below about 1 °C, they become 
unable to regulate magnesium in body 
fluids, leading to narcosis, clumsiness 
and paralysis of breathing. Most of the 
100 or so fish species currently found on 
the Antarctic shelf belong to a single sub- 
order, whose members evolved antifreeze 
proteins to keep their blood flowing at 


subzero temperatures and then diversified to 
fill most niches in the frigid seas. They lack 
powerful jaws’. 

The result is an ecosystem reminiscent of 
that 350 million years ago, in which the top 
predators are slow-moving invertebrates such 
as starfish, sea spiders and ribbon worms’. “All 
of this stuff has got a very Palaeozoic flavour 
to it? says Aronson. The relaxation of natural 
selection allowed species to lose their natural 
armour, says James McClintock, a marine biolo- 
gist at the University of Alabama at Birming- 
ham. Animals on the Antarctic sea floor “are 
very weakly skeletonized’, he says. “You can pick 
up an Antarctic clam and crush it in your hand” 

By the mid 2000s, Aronson began to believe 
that if Antarctica’s oceans warmed up, the eco- 
logical cascade that caused this blast-from-the- 
past ecosystem to flourish would run in reverse: 
crushing predators would return and wreak 
havoc’. That prediction is now being tested. 

Westerly winds are strengthening and the 
circumpolar current is intensifying, driven by 
atmospheric warming and a hole in the ozone 
layer over Antarctica. These changes are lift- 
ing warm, dense, salty water from 4,000 metres 
down in the Southern Ocean up over the lip of 
the continental shelf. 

Douglas Martinson, an oceanographer at the 
Lamont-Doherty Earth Observatory in Pali- 
sades, New York, has documented this process 
on the western side of the Antarctic Peninsula, 
where crabs are invading. Martinson installed 
five temperature and current sensors around 
Marguerite Trough — a deep canyon carved 


A WARMING WELCOME 


The circumpolar current is kicking 
salty warm water up onto 
Antarctica’s continental shelf, 
allowing crabs to invade. 
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into the sea floor by glaciers advancing to the 
edge of the continental shelf in past ice ages. 
The moorings captured an insidious process: 
as the circumpolar current skirts Antarctica’s 
continental shelf, it runs head-on into the steep 
wall of the trough. About once a week, a swirl- 
ing eddy containing 100 cubic kilometres of 
warm water wafts up from that collision, spill- 
ing onto the continental shelf**. The same 
thing happens elsewhere, says Martinson: “It 
looks like this is what happens at all of the can- 
yons that cut across the shelf” 

The temperature of this intruding water is 
only about 1.8 °C — but for an ocean region 
generally between 1 and -2°C, the impact is 
substantial. And the incursion seems to have 
begun only recently, says Eugene Domack, a 
marine geologist at Hamilton College in Clin- 
ton, New York, who led the 2010 cruise to 
Palmer Deep. 

Domack has managed to date’ the onset by 
measuring the amount of radioactive carbon-14 
in deep-sea corals collected from the continen- 
tal shelf — a process similar to reading tree 
rings. The corals had grown for 400 years before 
being dredged up. The carbon-14 content 
increased smoothly along the coral’s growth 
axis for the first 350 years, and then dropped 
suddenly — indicating that the coral was being 
bathed in water with a reduced carbon-14 
content. The water from the depths of the cir- 
cumpolar current would fit the bill: it has been 
isolated from the carbon in the atmosphere for 
centuries. On the basis of these measurements, 
Domack has deduced that the warm-water 
incursion “kicked in somewhere around the 
turn of the last century, 1920 or 1930”. 
According to Domack’s results, the incur- 
sion seems to have started following the end 

of the Little Ice Age — a period of relative 
cold that began in the Middle Ages — 
but it has intensified as anthropogenic 
warming and southern ozone depletion 
have taken hold. Average water temper- 
atures west of the Antarctic Peninsula 
have risen by 1 °C in the past 50 years, and 
continue to rise by 0.01-0.02°C per year’. 
“The heat injection is going through the roof? 
says Martinson. “It’s going up exponentially.” 


INVASIVE SPECIES 

The first evidence that crabs were poised to 
invade along with the warm water came early 
in 2007. Sven Thatje, a marine ecologist at the 
University of Southampton, UK, launched an 
ROV to the outer slope of the Antarctic Pen- 
insula to map glacial grooves on the sea floor. 
But its cameras also caught sight of 13 king 
crabs (Paralomis birsteini) between depths 
of 1,300 and 1,100 metres (ref. 8). Thatje had 
studied the cold tolerance of these crabs and 
concluded that they could probably survive 
farther north at 2,000—4,000 metres, where the 
water is a degree or two warmer — “but then we 
found them even on the continental slope” only 
500 metres below the shelf itself, he says. “These 
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King crabs (Neolithodes yaldwyni) are invading Antarctic seas, where they prey on local species. 


crabs were thriving at 1°C. They were basically 
at the physiological limit that I had anticipated” 

But it was Smith's discovery of Neolithodes 
yaldwyni king crabs in Palmer Deep, 120 kil- 
ometres in from the edge of the continental 
shelf, that demonstrated a true invasion. West 
of the Antarctic Peninsula, cold water sits on 
top of warmer water. To reach Palmer Deep 
from the outer ocean, crabs or larvae must 
have crossed what amounts to a cold, high 
mountain pass only 450 metres below sea level 
before settling into Palmer Deep, at depths of 
800-1,400 metres (ref. 1). 

In December 2010, Thatje, working with 
Aronson and McClintock, returned to Antarc- 
tica and towed a submersible up and down the 
continental slope near the mouth of Margue- 
rite Trough. The ROV traced 100 kilometres of 
sea floor, capturing 150,000 photographs that 
revealed hundreds of P. birsteini crabs between 
2,300 and 830 metres down’. “If you extrapo- 
late,” says McClintock, “we're talking about 
millions of crabs.” 

The crab invasion could have started a 
decade or two ago. When Smith re-examined 
photographs taken at the bottom of Palmer 
Deep in 1998, he saw telltale claw marks in 
the mud — indicating that at least some crabs 
were already present, even if none had been 
caught on camera’. Domack looked at 30 years 
of water-temperature data measured at Palmer 
Deep during earlier cruises, and found that 
the sea-floor valley had gradually warmed’ — 
becoming ever-more 


hospitable to the crabs. NATURE.COM 
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from deeper, warmer waters farther north in 
the Southern Ocean. The data from these 
experiments should help him to zero in on the 
crabs origins and the date of their arrival. 

But even without knowing the exact history 
of the invasion, the implications seem clear. 
Animals living on the edge of their physi- 
ological limits often struggle to survive and 
reproduce, but 19 out of 27 crabs that Smith 
collected during a cruise in 2011 turned out to 
be females carrying larvae or eggs. “This popu- 
lation is reproducing like crazy,’ he says. “It’s 
probably here to stay and expand.” As the ceil- 
ing of cold water continues to lift over the next 
10 or 20 years, crabs could spill out of Palmer 
Deep and Marguerite Trough — and colo- 
nize the broader continental shelf at depths 
of 400-600 metres, devastating the endemic 
sea life. 


STIFLING HEAT 

The warm waters will also bring other perils for 
Antarctica’s sea-floor gardens. Many of the spe- 
cies here are exquisitely sensitive to increases in 
temperature. The brittlestars and other inverte- 
brates have extremely slow metabolisms — an 
adaptation to the cold water — and only mea- 
gre ability to absorb and transport oxygen. “So 
what do those guys do if it warms up and their 
metabolic rate speeds up?” asks Lloyd Peck, 
a biologist at the British Antarctic Survey in 
Cambridge, who has monitored these creatures 
in aquarium warming experiments. Their oxy- 
gen demand revs beyond what their gills can 
supply — and they slowly suffocate”"”. 

About half of the two dozen Antarctic species 
that Peck has studied seem to do fine in water 
2°C warmer than current maximum summer 
temperatures — but the rest seem to suffer. 
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“At least two of the species that we think are 
going to be the first [to disappear] could give 
problems for the balance of the ecosystem,’ he 
says. Those are the Antarctic clam (Laternula 
elliptica)’ and the shallow-water brittlestar 
(Ophionotus victoriae)'°, both mainstay spe- 
cies that eat dead plankton and other organic 
trash that falls from above, and turn it into the 
biomass that feeds everything else on the sea 
floor. But these species either die or become 
dangerously sluggish at even 1 °C above cur- 
rent summer highs — temperatures that could 
become widespread in 50-100 years. 

If rising temperatures cause brittlestars and 
clams to disappear, then more falling detritus 
might be consumed by microbes instead of 
being converted into edible biomass — mean- 
ing that it would sustain fewer animals, over- 
all, on the sea floor. Alternatively, filter-feeding 
sponges might multiply to fill the niche. Either 
way, the mix of species supported further up 
the food chain might no longer include large 
numbers of archaic predators such as starfish, 
ribbon worms and sea spiders. 

Julian Gutt, a marine ecologist at the Alfred 
Wegener Institute for Polar and Marine 
Research in Bremerhaven, Germany, admires 
Smith’s work with the crabs, but withholds 
final judgement on whether the crustaceans 
are a new piece of this destructive puzzle, or 
along-present fixture. Repeat surveys show- 
ing that the crabs are expanding their foothold 
over time would confirm an invasion, he says. 
But “if they move into new habitat, some seri- 
ous impact is quite likely”. 

Aronson, for one, will be watching closely for 
signs that this is happening. And in his experi- 
ence, optimism is not warranted. “Every time 
we make a prediction of what we think will 
happen in the next 50 years, then poof, 10 years 
later, there it is, he says. “So I think this is going 
to be happening more rapidly than, as conserv- 
ative scientists, we're used to predicting.” = 


Douglas Fox is a freelance writer in the San 
Francisco Bay area, California. 
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laptop computer can double as an effective port- 
able knee-warmer — pleasant in a cold office. But a 
bigger desktop machine needs a fan. A data centre as 
large as those used by Google needs a high-volume 
flow of cooling water. And with cutting-edge super- 
computers, the trick is to keep them from melting. A 
world-class machine at the Leibniz Supercomputing 
Centre in Munich, for example, operates at 3 petaflops 
(3 x 10° operations per second), and the heat it produces warms some 
of the centre’s buildings. Current trends suggest that the next milestone 
in computing — an exaflop machine performing at 10"* flops — would 
consume hundreds of megawatts of power (equivalent to the output 
of a small nuclear plant) and turn virtually all of that energy into heat. 

Increasingly, heat looms as the single largest obstacle to computing’s 
continued advancement’. The problem is fundamental: the smaller and 
more densely packed circuits become, the hotter they get. “The heat 
flux generated by today’s microprocessors is loosely comparable to that 
on the Sun’s surface,” says Suresh Garimella, a specialist in computer- 
energy management at Purdue University in West Lafayette, Indiana. 
“But unlike the Sun, the devices must be cooled to temperatures lower 
than 100°C” to function properly, he says. 

To achieve that ever more difficult goal, engineers are exploring new 
ways of cooling — by pumping liquid coolants directly on to chips, for 
example, rather than circulating air around them. In a more radical 
vein, researchers are also seeking to reduce heat flux by exploring ways 
to package the circuitry. Instead of being confined to two-dimensional 
(2D) slabs, for example, circuits might be arrayed in 3D grids and net- 
works inspired by the architecture of the 
brain, which manages to carry out massive 
computations without any special cooling 
gear. Perhaps future supercomputers will 
not even be powered by electrical currents 
borne along metal wires, but driven elec- 
trochemically by ions in the coolant flow. 14 

This is not the most glamorous work in 
computing — certainly not compared to 12 
much-publicized efforts to make electronic 
devices ever smaller and faster. But those 
high-profile innovations will count for little 
unless engineers crack the problem of heat. 


quickly driven it up again. 
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GO WITH THE FLOW 
The problem is as old as computers. The 
first modern electronic computer — a 
30-tonne machine called ENIAC that was 
built at the University of Pennsylvania in 
Philadelphia at the end of the Second World 
War — used 18,000 vacuum tubes, which 
had to be cooled by an array of fans. The 
transition to solid-state silicon devices in 
the 1960s offered some respite, but the need 
for cooling returned as device densities climbed. In the early 1990s, 
a shift from earlier ‘bipolar’ transistor technology to complementary 
metal oxide semiconductor (CMOS) devices offered another respite by 
greatly reducing the power dissipation per device. But chip-level com- 
puting power doubles roughly every 18 months, as famously described 
by Moore's Law, and this exponential growth has brought the problem 
to the fore yet again’ (see ‘Rising temperatures’). Some of today’s micro- 
processors pump out heat from more than one billion transistors. If 
a typical desktop machine let its chips simply radiate their heat into 
a vacuum, its interior would reach several thousand degrees Celsius. 

That is why desktop computers (and some laptops) have fans. Air that 
has been warmed by the chips carries some heat away by convection, 
but not enough: the fan circulates enough air to keep temperatures at a 
workable 75°C or so. 

But a fan also consumes power — for a laptop, that is an extra drain on 
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RISING TEMPERATURES 


The switch from bipolar to complementary metal oxide 
semiconductor (CMOS) transistors lowered microchips’ 
heat output — but increasing device densities have 
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the battery. And fans alone are not always sufficient to cool the computer 
arrays used in data centres, many of which rely on heat exchangers that 
use liquid to cool the air flowing over the hot chips. 

Still larger machines demand more drastic measures. As Bruno 
Michel, manager of the advanced thermal packaging group at IBM in 
Riischlikon, Switzerland, explains: “An advanced supercomputer would 
need a few cubic kilometres of air for cooling per day.’ That simply is not 
practical, so computer engineers must resort to liquid cooling instead’. 

Water-cooled computers were commercially available as early as 1964, 
and several generations of mainframe computers built in the 1980s and 
1990s were cooled by water. Today, non-aqueous, non-reactive liquid 
coolants such as fluorocarbons are sometimes used, often coming 
into direct contact with the chips. These substances generally cool by 
boiling — they absorb heat and the vapour carries it away. Other systems 
involve liquid sprays or refrigeration of the circuitry. 

SuperMUC, an IBM-built supercomputer housed at the Leibniz 
centre, became operational in 2012. The 3-petaflop machine is one of 
the world’s most powerful supercomputers. It has a water-based cooling 
system, but the water is warm — around 45°C. The water is pumped 
through microchannels carved into a customized copper heat sink 
above the central processing unit, which concentrates cooling in the 
parts of the system where it is most needed. The use of warm water may 
seem odd, but it consumes less energy than other cooling methods, 
because the hot water that emerges from the system requires less chilling 
before it is reintroduced. And the use of hot-water outflow for heating 
nearby office buildings results in further energy savings. 

Michel and his colleagues at IBM believe that flowing water could 
be used not just to extract heat, but also 
to provide power for the circuitry in the 
first place, by carrying dissolved ions that 
engage in electrochemical reactions at 
energy-harvesting electrodes. In effect, 
the coolant doubles as an electrolyte ‘fuel. 
The idea is not entirely new, says Yogendra 
Joshi, a mechanical engineer at the Georgia 
Institute of Technology in Atlanta. “It has 
been used for many years in thermal man- 
agement of aircraft electronics’, which are 
cooled by jet fuel, he says. 

Delivering electrical power with an 
electrolyte flow is already a burgeoning 
technology. In a type of fuel cell known as a 
redox flow battery, for example, two electro- 
lyte solutions are pumped into an electro- 
chemical cell, where they are kept separate 
by a membrane that ions can flow through. 
Electrons travel between ions in the solu- 
tions in a process known as a reduction- 
oxidation (redox) reaction — but they are 
forced to do so through an external circuit, 
generating energy that can be tapped to 


1990 2000 2010 


provide electrical power. 


SALTY LOGIC 
Redox-flow cells can be miniaturized using microfluidic technology, in 
which the fluid flows are confined to microscopic channels etched into 
a substrate such as silicon’. At such small scales, the liquids can flow 
past each other without mixing, so there is no need for a membrane 
to separate them. With this simplification, the devices are easier and 
cheaper to make, and they are compatible with silicon-chip technology. 
Michel and his colleagues have begun to develop microfluidic 
cells for powering microprocessors, using a redox process based on 
vanadium ions. The electrolyte is pumped along microchannels that are 
100-200 micrometres wide and similar to those used to carry coolant 
flows around some chips. Power is harvested at electrodes spaced along 
the channel, then distributed to individual devices by conventional 
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Steam carries heat away from Google's data centre in The Dalles, Oregon. 


metal wiring. The researchers unveiled their preliminary results in 
August, at a meeting of the International Society of Electrochemistry 
in Prague’. 

But they remain some way from actually powering circuits this way. 
At present, the power density of microfluidic redox-flow cells is less than 
1 watt per square centimetre at 1 volt — two or three orders of magni- 
tude too low to drive today’s microprocessors. However, Michel believes 
that future processors will have significantly lower power requirements. 
And, he says, delivering power with microfluidic electrochemical cells 
should at least halve the power losses that occur with conventional metal 
wiring, which squanders around 50% of the electrical energy it carries 
as resistive heating. 


BECOMING BRAINIER 

Electrochemical powering could help to reduce processors’ heat 
dissipation, but there is a way to make a much bigger difference. Most 
of the heat from a chip is generated not by the switching of transis- 
tors, but by resistance in the wires that carry signals between them. 
The problem is not the logic, then, but the legwork. During the late 
1990s, when transistors were about 250 nanometres across, ‘logic’ and 
‘legwork accounted for roughly equal amounts of dissipation. But 
today, says Michel, “wire energy losses are now more than ten times 
larger than the transistor-switching energy losses”. In fact, he says, 
“because all components have to stay active while waiting for infor- 
mation to arrive, transport-induced power loss accounts for as much 
as 99% of the total”. 

This is why “the industry is moving away from traditional chip archi- 
tectures, where communication losses drastically hinder performance 
and efficiency’, says Garimella. The solution seems obvious: reduce the 
distance over which information-carrying pulses of electricity must 
travel between logic operations. Transistors are already packed onto 
2D chips about as densely as they can be. If they were stacked in 3D 
arrays instead, the energy lost in data transport could be cut drastically. 
The transport would also be faster. “If you reduce the linear dimension 
by a factor of ten, you save that much in wire-related energy, and your 
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information arrives almost ten times faster,” says Michel. He foresees 
3D supercomputers as small as sugar lumps. 

What might 3D packaging look like? “We have to look for examples 
with better communication architecture,” Michel says. “The human 
brain is such an example.’ The brain’s task is demanding: on average, 
neural tissue consumes roughly ten times more power per unit volume 
than other human tissues — an energy appetite unmatched even in an 
Olympic runner’s quadriceps. The brain accounts for just 2% of the 
body’s volume, but 20% of its total energy demand. 

Yet the brain is fantastically efficient compared to electronic comput- 
ers. It can achieve five or six orders of magnitude more computation 
for each joule of energy consumed. Michel is convinced that the brain’s 
efficiency is partly due to its architecture: it is a 3D, hierarchical network 
of interconnections, nota grid-like arrangement of circuits. 


SMART BUILD 

This helps the brain to make much more efficient use of space. In a 
computer, as much as 96% of the machine’s volume is used to transport 
heat, 1% is used for communication (transporting information) and 
just one-millionth of one per cent is used for transistors and other logic 
devices. By contrast, the brain uses only 10% of its volume for energy 
supply and thermal transport, 70% for communication and 20% for com- 
putation. Moreover, the brain's memory and computational modules are 
positioned close together, so that data stored long ago can be recalled in an 
instant. In computers, by contrast, the two elements are usually separate. 
“Computers will continue to be poor at fast recall unless architectures 
become more memory-centric’, says Michel. Three-dimensional pack- 
aging would bring the respective elements into much closer proximity. 

All of this suggests to Michel that, if computers are going to be 
packaged three-dimensionally, it would be worthwhile to try to emulate 
the brain's hierarchical architecture®. Such a hierarchy is already implicit 
in some proposed 3D designs: stacks of individual microprocessor chips 
(on which the transistors themselves could be wired in a branching 
network) are stacked into towers and interconnected on circuit boards, 
and these, in turn, are stacked together, enabling vertical communica- 
tion between them. The result is a kind of ‘orderly fractal’ structure, a 
regular subdivision of space that looks the same at every scale. 

Michel estimates that 3D packaging could, in principle, reduce 
computer volume by a factor of 1,000, and power consumption by a 
factor of 100, compared to current 2D architectures. But the introduc- 
tion of brain-like, ‘bionic packaging structures, he says, could cut power 
needs by another factor of 30 or so, and volumes by another factor of 
1,000. The heat output would also drop: 1-petaflop computers, which 
are now large enough to occupy a small warehouse, could be shrunk to 
a volume of 10 litres. 

If computer engineers aspire to the awesome heights of zetaflop 
computing (10” flops), a brain-like structure will be necessary: with 
today’s architectures, such a device would be larger than Mount Everest 
and consume more power than the current total global demand. Only 
with a method such as bionic packaging does zetaflop computing 
seem remotely feasible. Michel and his colleagues believe that such 
innovations should enable computers to reach the efficiency — if not 
necessarily the capability — of the human brain by around 2060. That 
is something to think about. = 


Philip Ball is a writer based in London. 
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In John Cooke’s painting Discussion on the Piltdown Skull (1915), scientists examine a human-like cranium found in Piltdown, England, in 1912. 


The 100-year mystery 
of Piltdown Man 


Chris Stringer explains why the longest-running whodunnit 
in palaeontology is still worth solving. 


h | ine years ago, the Natural History 
Museum in London, where I work, 
celebrated the 50th anniversary of 

the disclosure of one of the most successful 

scientific hoaxes in history. In 1953, scien- 
tists from the museum and the University 
of Oxford, UK, showed that specimens 
unearthed in the early twentieth century 
from the village of Piltdown in Sussex, UK, 
were an elaborate forgery — not evidence of 
a primitive human, as their discoverers had 


claimed in December 1912. 

During our celebrations, a jaded 
colleague said: “Id like to see the anniver- 
sary commemorated by the crushing of all 
the material and the burning of the Piltdown 
archive!” Yet, almost a decade later, I find 
myself part ofa growing team of investigators 
hoping to reveal how the forgery was accom- 
plished, and so to work out, finally, who did it 
and what drove them. 

Why — 100 years after ‘Piltdown Man’ 


was presented to the Geological Society of 
London — does it matter? 

Personally, I am intrigued by the question 
of whether the hoax was driven by scientific 
ambition or by more jocular or vindictive 
motives. Also, the story has continuing 
relevance beyond its allure as a whodunnit: 
it is a warning to scientists to keep their 
critical guard up, and an example of the 
(eventual) triumph of the scientific method. 

In 1891, a Dutch palaeoanthropologist 
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Clockwise from top: Charles Dawson (sitting) and Arthur Smith Woodward (right) at the Piltdown site 
around 1913; a local pub’s homage to Piltdown Man; the forgery’s extent is revealed in London, 1954; 
Kenneth Oakley (left) and L. E. Parsons examine Piltdown jaw, 1949; Eoanthropus skull models, 1913. 
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described the first known specimens of 
Homo erectus (‘Java Mar’), which he had 
discovered in Indonesia. Sixteen years later, 
a workman in Germany unearthed the jaw 
of ‘Heidelberg Man; a possible descendant 
of H. erectus. Thus it was with great pride 
that Arthur Smith Woodward and Charles 
Dawson announced on 18 December 1912 
that they had found an even more remark- 
able fossil human at Piltdown'. 

Woodward was a distinguished British 
palaeontologist and ‘Keeper of Geology’ 
at what was then the British Museum of 
Natural History; Dawson was an ama- 
teur antiquarian. What Woodward called 
Eoanthropus dawsoni (‘Dawson’s dawn 
man’) consisted of an ape-like mandible 
containing two molar teeth and parts of 
a human-like cranium. Woodward and 
Dawson had also unearthed primitive stone 
tools and fragments of fossilized mammals, 
including a hippopotamus and an elephant- 
like creature — all stained the same dark 
brown as the gravels in the pit where they 
were found. The finds led Woodward and 
Dawson to assert that Eoanthropus was 
potentially as ancient as Java Man — now 
known to be about one million years old. 

Excavations led by the pair over the next 
two years recovered more artefacts and fauna 
from the Piltdown site, including a canine 
tooth. Even a slab of elephant bone was 
dug up which, because of its shape, became 
known as the ‘cricket bat: Further work was 
disrupted by the onset of the First World War 
and Dawson's declining health. But before 
Dawson died in 1916, he wrote to Woodward 
saying that he had found further remains of 
fauna and Eoanthropus at a second site, a few 
kilometres from the original gravel pit. 


A COMPLEX FORGERY 
The discoveries made headlines throughout 
the world. But in palaeontological circles, 
reactions were mixed from the beginning. 
Several prominent British scientists fully 
accepted the existence of Eoanthropus. A 
minority joined some US and German 
researchers in questioning the association of 
the jaw and skull parts. They believed that an 
ancient ape fossil could have become mixed 
into deposits containing fragments of a more 
recent human skull. Some dissenters were 
won over by the tooth and skull fragments 
Dawson found at the second Piltdown loca- 
tion, which matched those from the first site. 
Through the 1920s and 1930s, Eoanthropus 
was increasingly marginalized as other 
ancient humans were discovered in Africa, 
China and Indonesia. None of these fossils 
showed Eoanthropus’ strange combination 
of ape-like jaw and human-like braincase. 
By 1950, things had become even worse 
for Eoanthropus. Kenneth Oakley, a geo- 
archaeologist at the museum, applied 
chemical tests to the relics and determined 
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WHO DID IT? 


Four main suspects in the Piltdown case 


Teilhard 

de Chardin 
Priest who 
discovered 
canine tooth 
at Piltdown. 


Martin 
Hinton 
Woodward's 
subordinate; 
later, ‘Keeper 
of Zoology’. 


that the jaw could not be more than 50,000 
years old, even though some of the Piltdown 
fauna was clearly much older. More compre- 
hensive studies by Oakley and his colleagues, 
published in 1953 and 1955, revealed the full 
extent of the forgery™”. 

A jaw and canine, probably from a 
modern orangutan, had been manipulated 
and stained, along with parts of a modern 
human skull. The fossil fauna had been 
gathered from many different places before 
being planted in the Piltdown gravels, and 
the stone tools had been similarly intro- 
duced after being stained to match the 
colour of the deposits. As for the ‘cricket bat’ 
— it had been carved with a steel knife from 
a fossilized elephant femur. 

Once the truth was out and national 
embarrassment overcome, the hunt was on 
for the culprit(s) in a palaeontological mys- 
tery that persists to this day (see ‘Who did 
it?’). Dawson was and is a prime suspect*®. 
Woodward and at least 12 others have also 
been accused, singly or in combination 
(see go.nature.com/x8uboq). 

My colleagues and I (at present, we are 
a team of 15 researchers from the Natural 
History Museum and several British uni- 
versities) are now examining the finds at a 
microscopic level. Using techniques such as 
radiocarbon dating and DNA and isotope 
studies, we hope to pin down the taxonomic 
identities and geographical origins of the 
specimens. We are also using spectroscopy 
to establish how many different stain- 
ing methods were applied to the Piltdown 
assemblages of bones, teeth and tools. 

If the materials collected from the two sites 


a 


Arthur Smith 
Woodward 

| Museum head 

of geology and 

co-leader of 

Piltdown finds. 


Charles 
Dawson 
Ambitious 

fossil collector 
and co-leader 

of Piltdown finds. 


match up, the culprit was likely to have been 
Dawson, as he was the sole ‘discoverer’ of the 
remains from the second site. If the canine 
turns out to have a different origin (and 
to have been stained differently) from the 
other samples, then its discoverer, Teilhard 
de Chardin, may have been involved in the 
scam’. De Chardin, then a novice Jesuit priest 
(later, a famous thinker and palaeontologist), 
helped Dawson in his work at the site. 

Alternatively, our results could strengthen 
or refute another recently proposed scenario 
involving zoologist Martin Hinton’. At the 
time of the Piltdown discoveries, Hinton 
was a volunteer in Woodward's department 
at the museum. During the 1970s, more than 
a dozen modified and stained bones and 
teeth were found among Hinton’s personal 
possessions (he died in 1961). 


MIXED BLESSING 

Identifying who doctored and planted the 
Piltdown assemblage is the key to under- 
standing why they did so. If it turns out that 
Dawson was to blame, for instance, scientific 
recognition was almost certainly the driving 
force. Dawson had ambitions to become a 
Fellow of the Royal Society, and he clearly 
liked to cultivate relationships with impor- 
tant people. 

It is unclear why Hinton would have 
played trickster. One possibility is that he 
surmised that something was amiss with 
the Piltdown findings but felt unable to 
raise his concerns to his department head 
(Woodward). Or perhaps he planted the 
‘cricket bat’ (seemingly a far less sophisti- 
cated forgery than the other artefacts) as a 


COMMENT 
way to alert his colleagues’. 


Regardless of who was responsible, the 
Piltdown hoax is a stark reminder to scien- 
tists that if something seems too good to be 
true, then perhaps it is. The hoax is unprec- 
edented in its complexity (some of the 
Piltdown finds were more expertly modified 
than others, and the skill of the forger(s) is 
one of the aspects we are examining). But 
it is not the only example of trickery within 
palaeontological and archaeological circles. 
Indeed, what happened at Piltdown may 
well have sped the revelation, in 2000, that 
the famous Japanese archaeologist Shinichi 
Fujimura was burying stone tools he had 
collected in previous digs to later unearth 
them as fresh finds. 

Piltdown Man’s temporary ascendancy 
undoubtedly cost the field. For instance, 
it delayed widespread acceptance of 
Australopithecus africanus, an early hominin 
found in South Africa in the mid-1920s, as a 
genuinely ancient relic of human evolution. 

Less obviously, Piltdown Man 
demonstrates the power of the 
scientific method to expose the truth, 
eventually (see go.nature.com/tbb9ma). 
The discoveries at Piltdown were steadily 
undermined by finds made elsewhere, 
even though the reasons for the specimens’ 
failure to withstand scrutiny were not 

fully articulated 


“Teshowshow until 1953. Indeed, 
far the field Piltdown Mans days 
of palaeo- were numbered once 
anthropology radiocarbon dating 
loneunihin began to be routinely 
the past 100 applied to the fossil 

4 record, from about 
years. 1959, 


That the story of 
Piltdown Man still resonates today is under- 
standable: asa ‘real-life whodunnit, itis hard 
to beat. Ultimately, however, it shows how far 
the field of palaeoanthropology has come in 
the past 100 years. m 


Chris Stringer is Research Leader in 
Human Origins in the Department of Earth 
Sciences, Natural History Museum, London 
SW7 5BD, UK. 

e-mail: c.stringer@nhm.ac.uk 


1. Dawson, C., Woodward, A. S. & Smith, G. E. 
Q. J. Geol. Soc. Lond. 69, 117-152 (1913). 
2. Weiner, J. S., Oakley, K. P. & Le Gros Clark, W. E. 
Bull. Br. Mus. Nat. Hist. 2, 139-146 (1953). 
3. Weiner, J. S. et al. Bull. Br. Mus. Nat. Hist. 
2, 225-287 (1955). 
4. Weiner, J. S. & Stringer, C. The Piltdown Forgery 
(Oxford Univ. Press, 2003). 
5. Russell, M. Piltdown Man: The Secret Life 
of Charles Dawson and the World’s Greatest 
Archaeological Hoax (Tempus, 2003). 
. McNabb, J. Archaeol. J. 163, 1-41 (2006). 
. Thackeray, J. F. Trans. R. Soc. S. Afr. 66, 9-13 
(2011). 
. Gardiner, B. G. Zool. J. Linn. Soc. 139, 315-335 
(2003). 
. Stringer, C. Homo Britannicus (Penguin, 2007). 


NOD 


Oo © 


13 DECEMBER 2012 | VOL 492 | NATURE | 179 


© 2012 Macmillan Publishers Limited. All rights reserved 


Know when your 
numbers are significant 


Experimental biologists, their reviewers and their publishers must grasp basic 
Statistics, urges David L. Vaux, or sloppy science will continue to grow. 


he incidence of papers in cell and 
| molecular biology that have basic 
statistical mistakes is alarming. I see 
figures with error bars that do not say what 
they describe, and error bars and P values for 
single, ‘representative’ experiments. So, as an 
increasingly weary reviewer of many a biol- 
ogy publication, I’m going to spell out again’ 
the basics that every experimental biologist 
should know. 

Simply put, statistics and error bars should 
be used only for independent data, and not 
for identical replicates within a single experi- 
ment. Because science represents the knowl- 
edge gained from repeated observations or 
experiments, these have to be performed 
more than once — or must use multiple 
independent samples — for us to have con- 
fidence that the results are not just a fluke, a 
coincidence or a mistake. To show only the 
result of a single experiment, even if it is a 
representative one, and then misuse statistics 


to justify that decision, erodes the integrity 
of the scientific literature. 

It is eight years since Nature adopted a 
policy of insisting that papers containing fig- 
ures with error bars describe what the error 
bars represent”. Nevertheless, it is still com- 
mon to find papers in most biology journals 
— Nature included — that contain this and 
other basic statistical errors. In my opinion, 
the fact that these scientifically sloppy papers 
continue to be published means that the 
authors, reviewers and editors cannot com- 
prehend the statistics, that they have not read 
the paper carefully, or both. 

Why does this happen? Most cell and 
molecular biologists are taught some 
statistics during their high-school or under- 
graduate years, but the principles seem to 
be forgotten somewhere between gradua- 
tion and starting in the lab. Often, the type 
of statistics they learnt is not relevant to the 
kinds of experiment they are now doing. 
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And, once in the lab, people generally just 
do what everyone else does, without always 
understanding why. 

Even if experimental biologists do not 
need to use statistical evidence for their 
own experiments, they should have an 
understanding of the basics so that they 
can interpret others’ work critically. They 
don’t all need to understand complex sta- 
tistics, or to hire professional statisticians, 
but there would be fewer sloppy papers if 
every author, reviewer and editor under- 
stood statistical concepts such as stand- 
ard deviation, standard error of the mean 
(s.e.m.), sampling error and the difference 
between replicate and independent data 
(see ‘Statistics glossary’). 


BACK TO BASICS 

In the life sciences there are typically two 
types of publication: those that use large data 
sets and rely mostly or wholly on statistical 


ILLUSTRATION BY PETE ELLIS/DRAWGOOD.COM 


evidence (for example, epidemiology, 
psychology, clinical trials and genome-wide 
association studies), and those that do not 
— such as much cell and molecular biology, 
biochemistry and classical genetics. 

For papers with large data sets that rely 
purely on statistical evidence, recommen- 
dations exist for computing sample size, 
reporting on outlying results and other 
issues*“, But these guidelines do not serve 
authors of the other category of papers. Cell 
and molecular biologists have the luxury 
of being able to probe their experimental 
systems in multiple, independent ways and 
can therefore often get by with Ns of three, 
without the need for sophisticated statistics. 

The first figure in a typical paper in cell or 
molecular biology, for example, might show 
the difference in phenotype between three 
wild-type and three gene-deleted mice. The 
second figure might compare the levels of 
proteins in cells derived from the mice, look- 
ing at both the deleted protein and one of its 
substrates, or the effects of treating wild-type 
cells with an inhibitor of the protein encoded 
by the deleted gene. If the evidence from these 
experiments is consistent, and gives support 
to acoherent model, it would be unnecessary 
to analyse 30 mice of each type, or to repeat 
the Western blots of protein levels 30 inde- 
pendent times. Watson and Crick’s paper on 
the structure of DNA° does not contain statis- 
tics, graphs with error bars or large Ns. 

Understanding the rudiments of statistics 
would stop experimental biologists from 
calculating a P value and a s.e.m. from trip- 
licates from one representative experiment, 
and might stop the reviewers and editors 
from letting these pass unquestioned. If the 
results from one representative experiment 
are shown, then N= 1 and statistics do not 
apply. Besides, it is always better to include a 
full data set, rather than withholding results 
that are not representative. When N is only 
2 or 3, it would be more transparent to just 
plot the independent data points, and let the 
readers interpret the data for themselves, 
rather than showing possibly misleading 
P values or error bars and drawing statisti- 
cal inferences. 

If the data in an experiment are equivocal, 
or the effect size is small, it is much better 
to come up with an extra, mechanistically 
different, experiment to test the hypothesis, 
than to repeat the same experiment until P is 
less than 0.05. 

If statistics are shown, it should be for 
a good reason. Descriptive statistics, such 
as range or standard deviations, are only 
necessary when there are too many data 
points to visualize easily. Inferential sta- 
tistics (an s.e.m., confidence interval or 
P value) should be shown only if they 
make it easier to interpret the results, and 
they should not detract from other key 
considerations such as the magnitude of 


Common uses 


Describing how broadly the sample values are 
distributed. 


STATISTICS GLOSSARY 
Some common statistical concepts and their uses in analysing experimental results. 
Term Meaning 
Standard deviation | The typical difference 
(s.d.) between each value and the 
mean value. 


An estimate of how variable 
the means will be if the 
experiment is repeated 
multiple times. 


With 95% confidence, the 
population mean will lie in 
this interval. 


Standard error of 
the mean (s.e.m.) 


Confidence interval 
(Cl; 95%) 


Independent data Values from separate 


that are not linked. 


Replicate data Values from experiments 


much as possible. 


Sampling error 
part of a population rather 
than measuring the whole 
population. 


experiments of the same type 


where everything is linked as 


Variation caused by sampling 


s.d.=V (> (x-mean)?/(N-1)) 


Inferring where the population mean is likely to 
lie, or whether sets of samples are likely to come 
from the same population. 

s.e.m.=s.d./vVN 


To infer where the population mean lies, and to 
compare two populations. 
Cl=meants.e.m. x twy-1 


Testing hypotheses about the population. 


Serves as an internal check on performance of an 
experiment. 


Can reveal bias in the data (if it is too small) 

or problems with conduct of the experiment 

(if it is too big). In binomial distributions (such 
as live and dead cell counts) the expected s.d. 
is V (Nxpx(1—p)); in Poisson distributions (for 
example, cells per field) the expected s.d. is 

v mean. 


N, number of independent samples; t, the t-statistic; p, probability. 


the effects or their biological significance. 

Figure legends should state the number 
of independent data points and, for experi- 
ments in which replicates were performed, 
only the mean of the replicates should be 
shown as a single independent data point. 
For replicates, no statistics should be shown, 
because they give only an indication of 
the fidelity with which the replicates were 
created: they might indicate how good the 
pipetting was, but they have no bearing on 


the hypothesis being 

(t7 Ed 
Experimental _ tested’. 

biologists All experimen- 
should know tal biologists and all 
what sort those who review 
of sampling their papers should 
errors are to know what sort of 


sampling errors are to 
be expected in com- 
mon experiments, such as determining the 
percentages of live and dead cells or count- 
ing the number of colonies on a plate or cells 
in a microscope field. Otherwise, they will 
not be able to judge their own data critically, 
or anyone else's. 


be expected.” 


REPEAT AFTER ME 

How can the understanding and use of 
elementary statistics be improved? Young 
researchers need to be taught the practicali- 
ties of using statistics at the point at which 
they obtain the results of their very first 
experiments. 

To encourage established researchers 
to use statistics properly, journals should 
publish guidelines for authors, reviewers 
and editors on the use and presentation 
of data and statistics that are relevant to 


the fields they cover. All journals should 
follow the lead of the Journal of Cell Biol- 
ogy’ and make a final check of all figures in 
accepted papers before publication. They 
should refuse to publish papers that contain 
fundamental errors, and readily publish 
corrections for published papers that fall 
short. This requires engaging reviewers who 
are statistically literate and editors who can 
verify the process. Numerical data should 
be made available either as part of the paper 
or as linked, computer-interpretable files so 
that readers can perform or confirm statisti- 
cal analyses themselves. 

When William Strunk Jr, a professor of 
English, was faced with a flood of errors in 
spelling, grammar and English usage, he 
wrote a short, practical guide that became 
The Elements of Style (also known as Strunk 
and White)*. Perhaps experimental biologists 
need a similar booklet on statistics. m 


David L. Vaux is professor of cell biology 
at the Walter and Eliza Hall Institute of 
Medical Research and at the University 
of Melbourne, Parkville, Victoria 3052, 
Australia. 

e-mail: vaux@wehi.edu.au 
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| COMMENT | BOOKS & ARTS 


ih ene 


Anurse takes blood from the heel of a newborn baby to screen for the amino-acid disorder phenylketonuria. 


Testing infant 


destinies 


Henry Greely hails a study examining California’s 
experience of mandatory newborn genetic screening. 


t the heart of Saving Babies? are the 
A™ consultation rooms in a South- 

ern California academic medical 
centre, crowded with families, genetics staff, 
medical students and “nosy ethnographers, 
who did their best to take up as little space, 
and air, as possible”. Medical sociologist 
Stefan Timmermans and medical anthro- 
pologist Mara Buchbinder may have been 
holding their breath, but their observations 
provide a fascinating portrait of the state’s 
expanded newborn-genetic-screening pro- 
gramme, and of the lives it has changed. 

In 2005, California became one of the 
first US states to adopt recommendations 
to expand its mandatory newborn genetic 
screening to more than 50 conditions, add- 
ing rarer, less well-understood ones like the 
organic acid oxidation disorder 3-MCC to 
those such as phenylketonuria (PKU) and 
sickle-cell disease. Saving Babies? covers the 
history of newborn genetic testing, but is 


based on a three-year 
study of 75 families 
who were referred, 
after screening, to an 
academic clinic for 
children’s metabolic 
disorders. The picture 
it paints is not one of 


: unalloyed success. 

Saving Babies? Timmermans and 
The Consequences — Buchbinder argue that 
op heenern the Californian sys- 
Genetic Screening Y 

STEFANTIMMERMANS tem was adopted too 
AND MARA hastily and without 
BUCHBINDER sufficient concern for 


Univ. Chicago Press: 
2012. 320 pp. $30, 
£19.50 


consequences. They 
find that unexpected 
results were common 
in the study, but that 
it is not clear how many babies received 
effective treatment as a result of the test- 
ing programme, nor how many lives were 
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saved. Quite clear, however, are the difficul- 
ties of plugging a universal “entitlement” to 
a screening test into the complex and often 
irrational US health-care system. 

The book begins with Scott and Renee 
Baio, a US celebrity couple whose daugh- 
ter Bailey tested positive for glutaric aci- 
duria type 1 (GA1), a condition involving 
an inability to process certain amino acids. 
Her test turned out to bea false positive, but 
during the harrowing wait to find out, her 
father pledged that if Bailey did not have 
the disease he would support children who 
did. Surprisingly, in light of the distress that 
accompanied the testing process, the Baios 
became strong advocates for newborn test- 
ing. Their experience of the uncertainties of 
genetic testing sets the stage for the rest of 
the book. 

The historical overview of newborn 
screening is excellent, covering both its 
1960s origins with PKU screening and the 
programme’s expansion throughout the 
United States, starting in 2005. At least three 
factors drove the expansion. One was the 
discovery that an existing technology, tan- 
dem mass spectroscopy, could be used to 
gauge the levels of many different proteins 
simultaneously, and hence cheaply. Another 
was pressure from organizations focusing on 
genetic diseases, such as the March of Dimes. 
And a third was an influential set of recom- 
mendations from the American College 
of Medical Genetics (ACMG), supporting 
expanded screening. 

Timmermans and Buchbinder power- 
fully convey how the screening programme 
converted some children into “patients-in- 
waiting” One child in their study underwent 
repeated rounds of GA1 testing while his 
parents remained uncertain whether their 
child was actually a ‘patient. This uncer- 
tainty, in part a function of the tests but also 
of our limited knowledge about many of the 
newly added conditions, can affect the lives 
of children and parents profoundly. 

The authors show how new evidence from 
the screening programme changed medical 
understanding of some of the conditions 
tested for, and how clinicians used shift- 
ing definitions of ‘normal to lead parents 
through various stages of their children’s 
conditions. The authors examine the Cali- 
fornian programme’ record in disease pre- 
vention and saving lives, goals that they show 
were inevitably compromised by flaws in the 
health-care system. They conclude that, at 
least for the metabolic diseases added into 
expanded screening, the results are not clear. 

Saving Babies? isn't perfect. I would have 
liked some exploration of why, in its 2005 
report, the ACMG 


moved from being cau- 2 NATURE.COM 
tious about neonatal  Forfetal 
screening to enthusias- _ diagnostics, see: 
ticaboutitsexpansion. —_ go.natulre.com/Ixwilzu 
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The book may also suffer from a kind of 
sampling bias. The families the authors saw 
most were those trapped in the most difficult 
situations — of either medical uncertainty or 
medical danger. They saw less of families for 
whom the screening was beneficial, particu- 
larly those whose children were diagnosed 
with a condition already in the screening 
programme before 2005. So the picture may 
be unduly gloomy. Finally, more discussion 
of how newborn screening is (or is not) done 
outside the United States would have contex- 
tualized this California-based study. 

The book does highlight some broader 
problems about scholarly work in this area. 
The first is its audience. Although, happily, 
the book is largely jargon-free, readers with- 
out much genetic knowledge may struggle 
to follow the valuable information for shap- 
ing screening policy, or be frustrated by the 
medical-sociology focus on observation 
rather than an assessment of alternative 
policies. Good policy will require input from 
many disciplines. All of us, from whatever 
discipline, need to work hard to make our 
findings useful to others. 

The second problem is time. After the 
2005 adoption of expanded screening in 
California, the authors studied families 
from November 2007 through to July 2010. 
The book will be read in 2013. That time lag 
was largely unavoidable, but by the time the 
programme can be assessed, it and its set- 
tings are no longer the same. The screening 
programme itself has already changed our 
understanding of some of the conditions 
in ways that undercut some of the authors’ 
findings. Meanwhile, the US health-care 
system is in the process of transformation. 

And newer challenges are at hand. Today, 
whole-exome and whole-genome sequenc- 
ing are being used clinically, often for chil- 
dren with conditions as poorly understood 
as those described here. With the plunging 
cost of sequencing, I think newborn sequence 
screening will not be far behind. And the 
challenges will not wait for a baby’s birth. 
In the United States and elsewhere, several 
companies are already using sequencing tech- 
nologies for prenatal testing for aneuploidies 
(disorders in which the fetus has the wrong 
number of chromosomes) using a simple 
maternal blood draw within the first trimes- 
ter; wider applications are surely coming. 

How can we implement and assess new 
technologies competently when the world is 
changing too fast for the assessors? That, to 
me, is the most disturbing question raised by 
this excellent book. = 


Henry T. Greely is the Deane E and Kate 
Edelman Professor of Law; professor (by 
courtesy) of genetics; and director of the 
Center for Law and the Biosciences at 
Stanford University, California. 

e-mail: hgreely@stanford.edu 


Books in brief 


The Artful Species: Aesthetics, Art, and Evolution 

Stephen Davies OXFORD UNIV. PRESS 320 pp. £25 (2012) 

This spare and elegant treatise by philosopher of aesthetics Stephen 
Davies posits that art is part of human nature, and is tied ina number 
of ways to human evolution. Moreover, he argues, the evidence could 
stretch back at least 400,000 years — to a blood-red quartzite hand 
axe dubbed Excalibur by the archaeologists who dug it up. Davies 
marshals findings in disciplines ranging from neuroscience, ethology 
and evolutionary biology to the arts, musicology and literature. 
Ultimately, he says, our artistic behaviour is both “puzzling and 
magnificent”, as we shoulder the heavy costs with perennial zeal. 


Mastermind: How to Think Like Sherlock Holmes 

Maria Konnikova VIKING 288 pp. $26.95 (2013) 

Devotees of Arthur Conan Doyle’s conundrum-cracker will be thrilled 
by this portmanteau of strategies for sharpening cognitive ability. 
Scientific American columnist Maria Konnikova mixes psychology and 
neuroscience with Holmesian technique and insights on everything 
from information storage (Holmes’s ‘brain attic’) to observation, 
awareness and razor-sharp deduction. A few hours in Konnikova’s 
company and, along with Holmes, you might intone, “give me the 
most abstruse cryptogram or the most intricate analysis, and | am in 
my own proper atmosphere” (The Sign of Four, 1890). 


: : The Fragile Wisdom: An Evolutionary View on Women’s Biology 
{hus Feet saga and Health 

; one Grazyna Jasienska HARVARD UNIV. PRESS 298 pp. £25.95 (2013) 
Women may aim for perfect health through diet, exercise and 
close attention to medical advice, but still develop breast cancer or 
osteoporosis. Reproductive fitness often wars with general physical 
fitness over a womans lifetime, argues public-health specialist 
Grazyna Jasienska. Drawing on a raft of research in evolutionary 
biology and beyond, she points to factors such as the disjunction 
between ‘palaeo’ and current lifestyles, hormonal disparities and 
longer lifespans as key to informing disease-prevention strategies. 


Mankind Beyond Earth: The History, Science, and Future of Human 
Space Exploration 

Claude A. Piantadosi COLUMBIA UNIV. PRESS 336 pp. $35 (2013) 
Despite difficulties such as cosmic radiation, huge distances, near- 
vacuum conditions and zero gravity, manned space flight still ignites 
the imaginations of millions. Medical doctor Claude Piantadosi 

fans the flames by boldly going into the past and possible future of 
US space exploration. This is a chronicle at warp speed, covering 
the science of space exploration; robots, spacecraft and the 
International Space Station; NASA's glory years; and the constraints 
on kick-starting ‘cheap’ space transportation. 


The Science of Middle-earth 

Henry Gee JILL GRINBERG LITERARY MANAGEMENT Available for 

Kindle only $4.99 (2012) 

Repeat immersions in Middle-earth beckon again as Peter Jackson’s 
first instalment of The Hobbit trilogy, An Unexpected Journey, opens 
in cinemas. Nature senior editor Henry Gee offers a revised and 
reissued guide to the science in J. R. R. Tolkien’s fictional world. This 
is acompletist’s feast, from the glow of Bilbo’s Elvish blade Sting 
(possibly the result of a chemical sensor “specifically tuned to Orkish 
exhalations”) to the aerodynamic unfeasibility of a Balrog’s wings. 
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The book may also suffer from a kind of 
sampling bias. The families the authors saw 
most were those trapped in the most difficult 
situations — of either medical uncertainty or 
medical danger. They saw less of families for 
whom the screening was beneficial, particu- 
larly those whose children were diagnosed 
with a condition already in the screening 
programme before 2005. So the picture may 
be unduly gloomy. Finally, more discussion 
of how newborn screening is (or is not) done 
outside the United States would have contex- 
tualized this California-based study. 

The book does highlight some broader 
problems about scholarly work in this area. 
The first is its audience. Although, happily, 
the book is largely jargon-free, readers with- 
out much genetic knowledge may struggle 
to follow the valuable information for shap- 
ing screening policy, or be frustrated by the 
medical-sociology focus on observation 
rather than an assessment of alternative 
policies. Good policy will require input from 
many disciplines. All of us, from whatever 
discipline, need to work hard to make our 
findings useful to others. 

The second problem is time. After the 
2005 adoption of expanded screening in 
California, the authors studied families 
from November 2007 through to July 2010. 
The book will be read in 2013. That time lag 
was largely unavoidable, but by the time the 
programme can be assessed, it and its set- 
tings are no longer the same. The screening 
programme itself has already changed our 
understanding of some of the conditions 
in ways that undercut some of the authors’ 
findings. Meanwhile, the US health-care 
system is in the process of transformation. 

And newer challenges are at hand. Today, 
whole-exome and whole-genome sequenc- 
ing are being used clinically, often for chil- 
dren with conditions as poorly understood 
as those described here. With the plunging 
cost of sequencing, I think newborn sequence 
screening will not be far behind. And the 
challenges will not wait for a baby’s birth. 
In the United States and elsewhere, several 
companies are already using sequencing tech- 
nologies for prenatal testing for aneuploidies 
(disorders in which the fetus has the wrong 
number of chromosomes) using a simple 
maternal blood draw within the first trimes- 
ter; wider applications are surely coming. 

How can we implement and assess new 
technologies competently when the world is 
changing too fast for the assessors? That, to 
me, is the most disturbing question raised by 
this excellent book. = 


Henry T. Greely is the Deane E and Kate 
Edelman Professor of Law; professor (by 
courtesy) of genetics; and director of the 
Center for Law and the Biosciences at 
Stanford University, California. 

e-mail: hgreely@stanford.edu 


Books in brief 


The Artful Species: Aesthetics, Art, and Evolution 

Stephen Davies OXFORD UNIV. PRESS 320 pp. £25 (2012) 

This spare and elegant treatise by philosopher of aesthetics Stephen 
Davies posits that art is part of human nature, and is tied ina number 
of ways to human evolution. Moreover, he argues, the evidence could 
stretch back at least 400,000 years — to a blood-red quartzite hand 
axe dubbed Excalibur by the archaeologists who dug it up. Davies 
marshals findings in disciplines ranging from neuroscience, ethology 
and evolutionary biology to the arts, musicology and literature. 
Ultimately, he says, our artistic behaviour is both “puzzling and 
magnificent”, as we shoulder the heavy costs with perennial zeal. 


Mastermind: How to Think Like Sherlock Holmes 

Maria Konnikova VIKING 288 pp. $26.95 (2013) 

Devotees of Arthur Conan Doyle’s conundrum-cracker will be thrilled 
by this portmanteau of strategies for sharpening cognitive ability. 
Scientific American columnist Maria Konnikova mixes psychology and 
neuroscience with Holmesian technique and insights on everything 
from information storage (Holmes’s ‘brain attic’) to observation, 
awareness and razor-sharp deduction. A few hours in Konnikova’s 
company and, along with Holmes, you might intone, “give me the 
most abstruse cryptogram or the most intricate analysis, and | am in 
my own proper atmosphere” (The Sign of Four, 1890). 


: : The Fragile Wisdom: An Evolutionary View on Women’s Biology 
{hus Feet saga and Health 

; one Grazyna Jasienska HARVARD UNIV. PRESS 298 pp. £25.95 (2013) 
Women may aim for perfect health through diet, exercise and 
close attention to medical advice, but still develop breast cancer or 
osteoporosis. Reproductive fitness often wars with general physical 
fitness over a womans lifetime, argues public-health specialist 
Grazyna Jasienska. Drawing on a raft of research in evolutionary 
biology and beyond, she points to factors such as the disjunction 
between ‘palaeo’ and current lifestyles, hormonal disparities and 
longer lifespans as key to informing disease-prevention strategies. 


Mankind Beyond Earth: The History, Science, and Future of Human 
Space Exploration 

Claude A. Piantadosi COLUMBIA UNIV. PRESS 336 pp. $35 (2013) 
Despite difficulties such as cosmic radiation, huge distances, near- 
vacuum conditions and zero gravity, manned space flight still ignites 
the imaginations of millions. Medical doctor Claude Piantadosi 

fans the flames by boldly going into the past and possible future of 
US space exploration. This is a chronicle at warp speed, covering 
the science of space exploration; robots, spacecraft and the 
International Space Station; NASA's glory years; and the constraints 
on kick-starting ‘cheap’ space transportation. 


The Science of Middle-earth 

Henry Gee JILL GRINBERG LITERARY MANAGEMENT Available for 

Kindle only $4.99 (2012) 

Repeat immersions in Middle-earth beckon again as Peter Jackson’s 
first instalment of The Hobbit trilogy, An Unexpected Journey, opens 
in cinemas. Nature senior editor Henry Gee offers a revised and 
reissued guide to the science in J. R. R. Tolkien’s fictional world. This 
is acompletist’s feast, from the glow of Bilbo’s Elvish blade Sting 
(possibly the result of a chemical sensor “specifically tuned to Orkish 
exhalations”) to the aerodynamic unfeasibility of a Balrog’s wings. 
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In the hit US television show Watch Mr. Wizard, actor Don Herbert (right) presented a weekly experiment to teach children about aspects of science. 


SCIENCE COMMUNICATION 


Science fresh from the box 


Eugenie Scott relishes a history of science programming on US television that reveals 
an often intrusive emphasis on thrills and chills. 


etween the start of the Second World 
War and the 1990s, US television 
evolved from a novel box with a hand- 
ful of network channels in flickering black 
and white to a cultural necessity: all flat 
screens in brilliant colour with hundreds of 
broadcast, satellite and cable channel choices. 
In Science on American Television, historian 
Marcel Chotkowski LaFollette tracks and 
analyses how television treated science over 
those crucial 50 years — a follow-up to her 
2008 analysis of science programmes on 
radio and early television, Science on the Air 
(University of Chicago Press). 
Entertainment is the key to understand- 
ing the relationship between science and 


Science on American 
Television: A History 
MARCEL CHOTKOWSKI 
LAFOLLETTE 

University of Chicago 
Press: 2012. 296 pp. 
$45 


television. Towards the end of her well- 
researched, thought-provoking book, 
LaFollette explains, “The need to attract 
the largest possible audiences pushed tele- 
visions version of science, whether intended 
as education or fiction, ever more toward 
sensationalization, politics, celebrities, 
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and representation and away from ... the 
thought and reasoning behind scientific 
conclusions and recommendations.” 

When the medium first emerged in the 
United States in the 1930s and 1940s, it fol- 
lowed a path already blazed by radio. The 
government licensed frequencies to com- 
mercial entities that paid a fee for broad- 
cast rights, but also sold advertising and 
provided entertaining content in which to 
intersperse it. Under this commercial model, 
governmental efforts to increase the qual- 
ity of broadcasts were sporadic and mostly 
anaemic. Attracting listeners and viewers 
became paramount. LaFollette notes that 
British radio and television, by contrast, 


NBCU PHOTO BANK/GETTY 


NEAL PRESTON/CORBIS 


were supported and supervised by the 
government, which established standards 
for quality and diversity of programming. 

Early on, says LaFollette, US networks 
experimented, mixing news and entertain- 
ment with science programmes and other 
cultural and educational fare. By modern 
standards, much of the science would be 
considered dry, wordy and stiff, as earnest 
scientists and science teachers declaimed 
while tapping blackboards with pointers. 

Jazzier representations were on offer, how- 
ever. The sitcom Mister Peepers (1952-55) 
starred Wally Cox as a science teacher peer- 
ing through thick glasses — stereotypically 
smart but nerdy, and falling into absurd 
fixes. In the hit show Watch Mr. Wizard 
(1951-65), science-obsessed actor Don 
Herbert, with the help ofa shifting coterie 
of child actors, tackled a weekly experiment, 
often of considerable sophistication. 

A standout mixture of science, anima- 
tion and live actors was the Bell Laboratory 
Science Series. It began in 1956 with the 
Frank Capra-directed Our Mr. Sun, a con- 
versation among a scientist, a writer, the 
Sun and other characters, many of them 
animated. (Capra, who shot classics such as 
It’s a Wonderful Life (1946), had a degree in 
chemical engineering from the California 
Institute of Technology.)The most popular 
of these films were seen by an estimated 
one-third of the television audience — about 
24 million North Americans. But they were, 
first and foremost, entertainment. 

In search of entertainment, the presenta- 
tion of science on television grew increas- 
ingly artificial. Disney’s True-Life Adventures 
nature films, recut for television, engaged 
young people with exotic species and 
regions. But the films anthropomorphized 
the animals and overshadowed the facts. 
One, In Beaver Valley (1950), shows beavers 
labouring with “stubborn resolve’, while 
“carefree” otters slide down snowy hills to 
zany music. Fictional medical doctors from 
Marcus Welby to Dr Kildare were heavy on 
charisma and light on science. LaFollette 
shows how even news reportage on the AIDS 
epidemic in the 1980s changed perspective, 
from scientific accuracy and searches for a 
cure to human interest, as homophobia and 
political debate took their toll. 

Not all is bleak in LaFollette’s history. 
The Public Broadcasting Service (PBS), a 
non-commercial network partly supported 
by the government and founded in 1969, is 
the natural home of art, opera, ballet, clas- 
sical music — and science. Several series 
have given brilliant science communica- 
tors their say, such as Carl Sagan in the 
1980 Cosmos, on the nature and history of 
the Universe and life, and Philip Morrison 
in the 1987 The Ring of Truth, on the process 
(and excitement) of scientific discovery. Bill 
Nye the Science Guy (1993-98) is a model of 
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Fictional forensics on the job in CSI: Crime Scene Investigation. 


outreach to young people. 

Meanwhile, imported programmes such 
as The Nature of Things, starring Canadian 
David Suzuki, and many from the BBC have 
been beacons in the gloom. Also strong are 
the Nova documentaries produced by Boston 
PBS station WGBH since 1974 and featur- 
ing compelling presenters such as astrono- 
mer Neil deGrasse Tyson or biologist Sean 
B. Carroll. Even the commercial series CSI 
models scientific reasoning as actors hypoth- 
esize, eliminate alternatives and finally settle 
on the most probable explanation. 

When cable stations underwent their 
primary adaptive radiation in the 1980s 
and 1990s, science fans hoped that the ‘long 
tail of specialized programming would 
produce networks serving their interests. 
But even the National Geographic and Dis- 
covery channels have proved disappointing 
— presenting a mishmash of actual science 
and breathless forays into cryptozoology, 
crank medicine and paranormal phenom- 
ena. The network TLC’s similar transition 
in focus from quality history, science, news, 
and technology to fluffier reality-based 


programming is disheartening, given that it 
was set up by NASA and the US Department 
of Health, Education, and Welfare, and was 
once known as The Learning Channel. 

If television is a vast wasteland, as the chair 
of the Federal Communications Commis- 
sion famously said in 1961, it is especially 
inhospitable to science. LaFollette analyses 
attempts by the Smithsonian Institution, 
the American Association for the Advance- 
ment of Science and other scientific bodies 
to engage with television. These have often 
failed. The culture of professional scientists 
and the culture of an entertainment medium 
are destined to clash. However, we must find 
a way to improve the situation while working 
within commercial constraints. Americans 
spend about half of their leisure time in front 
of the descendants of those flickering boxes 
of the 1940s. Wasteland or not, television is 
simply too important to ignore. = 


Eugenie Scott is the executive director of 
the National Center for Science Education in 
Oakland, California, USA. 

e-mail: scott@ncse.com 
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End the scandal of 
false cell lines 


Cell-line misidentification is 

a continuing problem (see, for 
example, Nature 457, 935-936; 
2009), with an estimated 15% 

or more of human cell lines not 
being derived from the claimed 
source. This means that billions 
of dollars have been wasted over 
the past 45 years on producing 
misleading or false data. Although 
a few progressive journals and 
funding bodies demand that 
cell lines be authenticated, 

this practice needs to become 
standard and universal. 

Cross-contamination between 
human cell lines was first 
described in 1967, yet these 
same cell lines continue to be 
used in published studies under 
their false names and tissue 
attributions (see Nature Rev. 
Cancer 10, 441-448; 2010). 

The method used to 
authenticate human cell lines is 
called short-tandem-repeat (STR) 
profiling, and a standard released 
by the American National 
Standards Institute provides 
protocols (see go.nature.com/ 
cijups). The International Cell 
Line Authentication Committee 
(ICLAC) has produced helpful 
hints for testing authenticity (see 
go.nature.com/utig5g) and for 
incorporating authentication into 
good tissue-culture practice (see 
go.nature.com/jgip8v). 

A rapidly expanding list that 
already contains more than 
400 misidentified cell lines is 
freely accessible (see go.nature. 
com/soppaj) so that anyone can 
check whether a cell line has 
already been identified as false. 
Also, the US National Center for 
Biotechnology Information is 
developing a database of reference 
STR profiles of human cell lines 
(see go.nature.com/edfmc)). 

Given that these resources are 
now available to the scientific 
community, and once all 
journals and funding bodies 
make authentication testing 
compulsory, the scandalous use of 
misidentified human cells should 
finally become a thing of the past. 


John R. Masters* University 
College London, UK. 
j.masters@ucl.ac.uk 

*On behalf of the ICLAC. For 

a full list of signatories and 
competing financial interests, see 
go.nature.com/7vadce. 


Settle discord over 
the Southern Ocean 


It is alarming that the 
Commission for the 
Conservation of Antarctic 
Marine Living Resources 
(CCAMLR) failed to reach 
consensus over proposals to 
establish any new large-scale 
marine protected areas (MPAs) 
in the Southern Ocean this 

year (Nature http://doi.org/jxb; 
2012). Some members set a 
worrying precedent by trying to 
dismiss advice previously agreed 
by the commission’s scientific 
committee and working groups. 
The CCAMLR must deliver on 
its commitments, to uphold 

its reputation and convince 

the world that protecting the 
Antarctic marine ecosystem 
remains paramount. 

The CCAMLR was committed 
to establishing MPAs by 2012, 
having designated its first 
MPA in 2009. Since then, 
the viewpoints of CCAMLR 
member states have polarized 
(see CCAMLIR report XXX]; 
go.nature.com/fqqpdh). These 
fishing nations may fear that new 
MPAs could deny them future 
economic opportunities. 

Food security is a vital 
global issue, and the Southern 
Ocean, home to one of the 
last underdeveloped sources 
of marine protein, Antarctic 
krill, could be critical. 
Governments need to recognize 
that conservation must go 
hand in hand with sustainable 
harvesting. 

To ensure that their history 
of cooperation continues, all 
CCAMLR members should 
engage in the development 
process for MPA proposals, 
evaluating threats to biodiversity 
and identifying mutually 
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desirable outcomes. Joint 

MPA proposals are needed 

that incorporate data and 
analyses from as many members 
as possible. Input from 
non-governmental organizations 
that focus on development, as 
well as from conservationists, 
would help. 

Philip N. Trathan British 
Antarctic Survey, Natural 
Environment Research Council, 
Cambridge, UK. 

pnt@bas.ac.uk 


Romania needs 
overseas reviewers 


International review has 

had a positive influence on 
funding decisions in Romanian 
universities. As members of 
the first overseas grant-review 
panel to operate in the country, 
we are therefore concerned to 
learn that the research minister 
intends to discontinue the use 
of international referees (Nature 
http://doi.org/jwn; 2012) ina 
scheme originally drawn up to 
improve the country’s scientific 
and ethical standing. 

Every proposal we adjudicated 
— for postdoctoral fellowships or 
grants for early-stage researchers 
and senior principal investigators 
— was evaluated by at least three 
anonymous international experts 
and by external rapporteurs. 
These panels each examined 
about 100 applications, with 
particular emphasis on the merit 
of proposals and on the integrity 
of CVs. We were impressed 
by the overall quality of this 
innovative system. 

Young Romanian scientists 
will be heartened to learn that 
this round of grant funding 
met all international standards 
of scrutiny and peer review. 

It will be most unfortunate 

for Romania to thwart this 
progress towards improving its 
international research status. 
Illimar Altosaar* University of 
Ottawa, Canada. 
altosaar@uottawa.ca 

*On behalf of 9 co-authors (see 
go.nature.com/m7rdby). 


Put soil security on 
the global agenda 


An international coalition of 
scientists has launched the Soil 
Carbon Initiative (see go.nature. 
com/onhgcv), convened by the 
United States Studies Centre 
and the Faculty of Agriculture 
at the University of Sydney in 
Australia, to bring policy on 
soil security in line with that on 
food and water security. 

Soil security refers to the 
maintenance and improvement 
of soils worldwide so that 
they can continue to provide 
food, fibre and fresh water, 
contribute to energy and 
climate sustainability and help 
to maintain biodiversity and 
protect ecosystem goods and 
services. 

Soil carbon needs to be 
monitored and managed — as 
was discussed during Global 
Soil Week in Berlin last month, 
at a meeting organized by 
the Institute for Advanced 
Sustainability Studies in 
Potsdam, Germany, and its 
international partners (see 
www.globalsoilweek.org). Also 
on the agenda were topics such 
as carbon trading and ‘zero net 
land degradation’ (see go.nature. 
com/dly4g3). 

Although scientists have 
made significant inroads into 
understanding soil function, 
international policy must keep 
pace with these advances. 
Andrea Koch, Alex McBratney 
University of Sydney, Australia. 
andrea. koch@sydney.edu.au 
Rattan Lal Ohio State University, 
Columbus, Ohio, USA. 
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A constructive debate 


Synthetic chemistry has long been used to prepare useful compounds — especially those that are hard to obtain from 
natural sources. But synthetic biology is coming of age as an alternative strategy. A biologist and two chemists debate the 


merits of their fields’ synthetic prowess. 


Building with 
biology 


SL. biology is essentially the assembly 
of well-characterized biological compo- 
nents into a system that performs a function, 
such as synthesizing a chemical. The field has 
advanced to the point that one can imagine 
producing nearly any organic molecule — 
even those that are not produced naturally 
— in an engineered microorganism. This has 
enormous implications for the production of 
speciality and bulk chemicals, drugs and fuels. 

Structurally complex pharmaceutical ingre- 
dients based on natural products are particu- 
larly good targets for microbial production 
(Fig. 1a), because they can be difficult to pro- 
duce by conventional chemical synthesis. Even 
when chemical syntheses for natural products 
are available, the routes used are often too long 
and/or low-yielding for large-scale prepara- 
tion. For commercial production, such mole- 
cules are therefore typically harvested from 
organisms that produce them naturally, or 
from a mutant that generates higher yields. 
Alternatively, a semi-synthesis can be used 
in which a precursor to a desired compound 
is obtained from an organism and then con- 
verted to the final product using organic 
synthesis. However, these approaches tend to 
be time-consuming and expensive. 

Naturally occurring compounds can be 
produced in microorganisms by transferring 
product-specific enzymes, or even whole 
metabolic pathways, from rare and/or geneti- 
cally intractable organisms to those that can 
be readily engineered’. Similarly, fuels, bulk 
chemicals and speciality chemicals that are not 
produced naturally can be obtained by com- 
bining enzymes or metabolic pathways from 
different hosts into a single microorganism, or 
by engineering enzyme functions’. 

Synthetic biology has also been used for the 
large-scale semi-synthesis of natural products. 
For example, the antimalarial drug artemisinin 
is extracted from the plant Artemisia annua, 


Figure 1 | Reaction vessels. a, Some biologists have argued that structurally complex molecules are best 
prepared in genetically engineered organisms, such as the bacterium Escherichia coli (pictured). b, Others 
think that chemical methods will endure as the most general option for synthesizing any desired compound. 


but it is in short supply and is too expensive 
for most people with malaria’. By combining 
genes from A. annua and other organisms 
into a single strain of the yeast Saccharomyces 
cerevisiae, we have produced a fermentation 
process* to make artemisinic acid — which 
can easily be converted to artemisinin using 
chemical methods’ — from simple sugars. 
The process is cost-effective, environmentally 
friendly and reliable, and is being developed 
for commercial production of the drug. 
Synthetic biology has many advantages over 
chemical synthesis. First, the intermediates 
in a biosynthesis do not need to be purified 
before being used as substrates in the next 
reaction. Second, the many ‘protection and 
‘de-protectiom steps typical of chemical syn- 
thesis — steps in which chemical groups are 
temporarily modified to stop them taking 
part in unwanted side reactions — can be 
avoided, because biosynthetic enzymes cata- 
lyse reactions only at the required positions 
ina substrate, avoiding side reactions at other 
groups. Third, most products of enzymatic 
reactions are racemically pure (the products 
form as just one of two possible mirror-image 
isomers), which is important for biologically 
active molecules. Fourth, cells can be engi- 
neered to secrete the final product, making it 
easier to purify. Finally, synthetic biology can 
use simple starting materials from renewable 
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sources, helping to reduce our dependence on 
oil-derived feedstocks. 

Even so, if synthetic biology is to match the 
power of synthetic chemistry, several problems 
must be addressed. For example, the biosyn- 
thetic enzymes responsible for producing 
important natural products need to be iden- 
tified, so that they can be used in syntheses. 
We must also improve our ability to design 
enzymes that catalyse reactions not found in 
nature’ if we are to expand the types of chem- 
istry that can be engineered into cells. And we 
must learn how to reliably engineer biosyn- 
thetic pathways to achieve desired outcomes. 

I envisage a day when customized cells will 
be built as catalysts for the biosynthesis of 
natural products, by designing chromosomes 
that harbour genes encoding the necessary 
biosynthetic pathways and also the minimal set 
of genes needed to construct the host organ- 
ism from minimal nutrients. That day is fast 
approaching. 


Jay D. Keasling is in the Department of 
Chemical and Biomolecular Engineering, and 
the Department of Bioengineering, University 
of California, Berkeley, Berkeley, California 
94720, USA. He is also at the Lawrence 
Berkeley National Laboratory and at the Joint 
BioEnergy Institute in California. 

e-mail: keasling@berkeley.edu 
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Practical 
chemistry 


ABRAHAM MENDOZA & PHIL S. BARAN 


ince the birth of synthetic chemistry about 

180 years ago, society has loved the wonders 
bestowed by the field, such as life-saving medi- 
cines, pest control and molecules that light up 
telephone displays, yet has harshly criticized 
it as being solely responsible for pollution 
and environmental harm. Synthetic biology 
emerged as an alternative for constructing 
molecules only about ten years ago, but some 
have already proclaimed’ that it will supplant 
chemical synthesis. No one should doubt the 
usefulness of synthetic biology, or its potential 
to shorten synthetic routes and reduce waste 
in chemical production. But we are convinced 
that synthetic chemistry will continue to dom- 
inate for the foreseeable future (Fig. 1b), for 
three main reasons. 

The first reason is that chemical synthesis 
is the best way to solve supply problems. For 
decades, synthetic chemistry has provided suf- 
ficient quantities of agrochemicals, medicines, 
perfumes and materials for society's needs. The 
pharmaceutical industry in particular relies on 
chemical methods for the large-scale produc- 
tion of most small-molecule drugs. The major- 
ity of these compounds are based on molecular 
structures not found in nature, which means 
that they cannot be prepared through enzy- 
matic processes and are likely to be toxic to 
the host organisms used in synthetic biology. 
Synthetic biology has had a crucial impact on 
the commercial production of some medi- 
cines derived from complex natural products, 
such as artemisinin’ and the anticancer drug 
paclitaxel (Taxol)*. But natural products are 
essentially the only compounds for which 
biological syntheses can compete with chemi- 
cal ones, because evolution has optimized the 
biosynthesis of those products over time. 

So the supply of chemicals is best addressed 
by synthetic chemistry, unless a specific natural 
product is required in large quantities — and 
even then, semi-synthetic strategies involving 
afew chemical steps are often required. In fact, 
total chemical syntheses of natural products are 
becoming increasingly efficient and scalable, as 
demonstrated by the impressive routes used to 
make tetracycline antibiotics’ and the antican- 
cer agent eribulin”®. A practical chemical syn- 
thesis of artemisinin has also now emerged"! 
that could form the basis of an industrial-scale 
process for making the drug, and a large-scale 
synthesis of Taxol is being developed”. 

Optimizing the properties of useful 
compounds, or adapting their functions to 
new applications, often requires modifica- 
tion of their molecular structures. The second 
reason that synthetic chemistry will endure 
is that chemical methods provide a reliable 


set of tools to do this in many fundamentally 
different cases. Moreover, unlike biological 
syntheses, chemical syntheses can often be 
developed and implemented rapidly, which is 
a great advantage. 

The third reason is that chemistry excels in 
the invention of unnatural molecules that have 
desirable physical properties — such as dyes 
for printable organic solar cells, fluorescent 
probes for biological research or radiolabelled 
drugs used in medicine. The molecular needs 
of vibrant modern fields such as supramolec- 
ular chemistry, chemical biology and nano- 
technology can be addressed only by synthetic 
chemistry. This is partly because the required 
molecules contain motifs that nature can- 
not assemble or that would be toxic to host 
organisms when biosynthesized at the con- 
centrations required for a practical production 
process. The exponential pace of development 
of these fields also means that the compounds 
needed are continuously changing, limiting 
the time available to synthesize and evaluate 
them. General chemical methods that can 
be applied quickly are therefore much more 
suitable for making such compounds than 
bioengineering. 

Over the years, many people have advocated 
alternatives to synthetic chemistry or expressed 
the opinion that it is already a mature field and 


CELL BIOLOGY 


All clear for 
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that all future advances will be incremental at 
best. But the field is as lively as ever: a seem- 
ingly infinite number of problems are waiting 
to be solved, and legions of talented students 
are eager to solve them. Synthetic biology surely 
has a bright future, but no approach to making 
molecules is more generally useful and has such 
seemingly limitless potential than synthetic 
chemistry. It is here to stay. = 


Abraham Mendoza and Phil S. Baran are 
in the Department of Chemistry, The Scripps 
Research Institute, La Jolla, California 92037, 
USA. 

e-mail: pbaran@scripps.edu 
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ribosome landing 


The discovery of a dramatic structural rearrangement that is stabilized by an 
RNA scaffold helps to explain how nascent proteins are delivered for export from 


the cell cytoplasm. SEE LETTER P.271 


HARRIS D. BERNSTEIN 


rotein sorting within a cell is not an easy 
Pp undertaking. The synthesis of many 

proteins destined to leave the cell or to 
become part of a cellular membrane must be 
coupled with their transport across or into the 
appropriate membrane. A universal RNA- 
protein complex called the signal recogni- 
tion particle (SRP) plays an essential role in 
this process by interacting with ribosomes, 
the cell’s protein-synthesizing factories, while 
they are engaged in protein translation. SRP 
then escorts the translating ribosomes to a 
cellular organelle known as the endoplasmic 
reticulum or, in bacteria, to the cytoplasmic 
membrane, where they dock onto a transport 
channel called the Sec complex, or SecYEG in 
bacteria. SRP must presumably be displaced 
from the ribosome for docking to happen, but 


the mechanism of this molecular exchange 
has been an enigma. On page 271 of this issue, 
Shen et al.' provide compelling evidence thata 
remarkable conformational change driven by 
the RNA component of SRP enables the ribo- 
some to deliver its protein cargo efficiently. 
Most membrane and secreted proteins that 
emerge from translating ribosomes carry a 
signal peptide — a sequence at their amino 
terminus that earmarks them for export from 
the cytoplasm. SRP recognizes and binds to this 
peptide and then interacts with a membrane 
receptor (FtsY) before releasing its nascent- 
protein cargo and transferring the ribosome 
to SecYEG (Fig. 1). This co-translational mode 
of protein targeting ensures that proteins 
destined to leave the cytoplasm begin their jour- 
ney before they aggregate or fold into a structure 
that cannot fit through the SecYEG channel. 
Although SRP has undergone dramatic 
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ince the birth of synthetic chemistry about 

180 years ago, society has loved the wonders 
bestowed by the field, such as life-saving medi- 
cines, pest control and molecules that light up 
telephone displays, yet has harshly criticized 
it as being solely responsible for pollution 
and environmental harm. Synthetic biology 
emerged as an alternative for constructing 
molecules only about ten years ago, but some 
have already proclaimed’ that it will supplant 
chemical synthesis. No one should doubt the 
usefulness of synthetic biology, or its potential 
to shorten synthetic routes and reduce waste 
in chemical production. But we are convinced 
that synthetic chemistry will continue to dom- 
inate for the foreseeable future (Fig. 1b), for 
three main reasons. 

The first reason is that chemical synthesis 
is the best way to solve supply problems. For 
decades, synthetic chemistry has provided suf- 
ficient quantities of agrochemicals, medicines, 
perfumes and materials for society's needs. The 
pharmaceutical industry in particular relies on 
chemical methods for the large-scale produc- 
tion of most small-molecule drugs. The major- 
ity of these compounds are based on molecular 
structures not found in nature, which means 
that they cannot be prepared through enzy- 
matic processes and are likely to be toxic to 
the host organisms used in synthetic biology. 
Synthetic biology has had a crucial impact on 
the commercial production of some medi- 
cines derived from complex natural products, 
such as artemisinin’ and the anticancer drug 
paclitaxel (Taxol)*. But natural products are 
essentially the only compounds for which 
biological syntheses can compete with chemi- 
cal ones, because evolution has optimized the 
biosynthesis of those products over time. 

So the supply of chemicals is best addressed 
by synthetic chemistry, unless a specific natural 
product is required in large quantities — and 
even then, semi-synthetic strategies involving 
afew chemical steps are often required. In fact, 
total chemical syntheses of natural products are 
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demonstrated by the impressive routes used to 
make tetracycline antibiotics’ and the antican- 
cer agent eribulin”®. A practical chemical syn- 
thesis of artemisinin has also now emerged"! 
that could form the basis of an industrial-scale 
process for making the drug, and a large-scale 
synthesis of Taxol is being developed”. 

Optimizing the properties of useful 
compounds, or adapting their functions to 
new applications, often requires modifica- 
tion of their molecular structures. The second 
reason that synthetic chemistry will endure 
is that chemical methods provide a reliable 


set of tools to do this in many fundamentally 
different cases. Moreover, unlike biological 
syntheses, chemical syntheses can often be 
developed and implemented rapidly, which is 
a great advantage. 

The third reason is that chemistry excels in 
the invention of unnatural molecules that have 
desirable physical properties — such as dyes 
for printable organic solar cells, fluorescent 
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drugs used in medicine. The molecular needs 
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ular chemistry, chemical biology and nano- 
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centrations required for a practical production 
process. The exponential pace of development 
of these fields also means that the compounds 
needed are continuously changing, limiting 
the time available to synthesize and evaluate 
them. General chemical methods that can 
be applied quickly are therefore much more 
suitable for making such compounds than 
bioengineering. 

Over the years, many people have advocated 
alternatives to synthetic chemistry or expressed 
the opinion that it is already a mature field and 
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that all future advances will be incremental at 
best. But the field is as lively as ever: a seem- 
ingly infinite number of problems are waiting 
to be solved, and legions of talented students 
are eager to solve them. Synthetic biology surely 
has a bright future, but no approach to making 
molecules is more generally useful and has such 
seemingly limitless potential than synthetic 
chemistry. It is here to stay. = 
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Figure 1 | Handover of translating ribosomes. a, The signal recognition 
particle (SRP) has a protein (Ffh) component and an RNA component that 
forms a hairpin structure. Ffh recognizes and binds to signal peptides on 
nascent proteins during protein translation. b, The GTP-bound GTPase 
domain of Fth (labelled GTP) subsequently interacts with the GTPase 
domain of FtsY — a receptor that resides in the endoplasmic reticulum or 
the bacterial cytoplasmic membrane. In bacteria, FtsY recruits the SecYEG 


expansions and contractions during evolution, 
even the simplest bacterial SRPs contain two 
highly conserved core elements — a protein 
(Ffh) that binds to the signal peptide and an 
elongated, roughly 100-nucleotide RNA that 
forms a ‘hairpin. Despite this relative simplicity, 
SRP must not only distinguish signal peptides 
from the bulk of nascent polypeptides, but also 
release them at the right place and time. 

Many aspects of the protein-targeting cycle 
mediated by SRP are well understood. For 
instance, it is known that the signal-peptide- 
recognition and RNA-binding activities of 
Ffh both reside in a single domain, called the 
M domain. Also, both Ffh and FtsY contain a 
GTPase domain that hydrolyses the nucleotide 
GTP to GDP and thereby acts as a regulatory 
switch. In the presence of a signal peptide, 
SRP RNA catalyses an interaction between 
the GTPase domains of Ffh and FtsY, and 
the binding of GTP brings the two domains 
into close proximity~*. Subsequently, struc- 
tural rearrangements in the GTPase domains 
promote GTP hydrolysis, disassembly of the 
SRP-FtsY complex and dissociation of SRP 
from the ribosome*. Moreover, in the bacte- 
rium Escherichia coli, FtsY seems to recruit 
SecYEG to ensure that nascent polypeptides 
are released at the correct location’, 

Despite much progress, however, it has 
remained unclear how the transfer of ribo- 
some -nascent-protein complexes to SecYEG 
is coordinated with the enzymatic activity of 
the GTPase domain and the recycling of the 
SRP-FtsY complex. To complicate matters, SRP 
and SecYEG bind to overlapping sites on L23, 
a ribosomal protein located at the point where 
nascent polypeptides emerge from a tunnel in 
the large subunit of the ribosome®. SRP must 
therefore be reoriented without prematurely 
releasing its rlbosome-nascent-protein cargo. 

A clue as to how the transfer might occur 


and FtsY. 


emerged from a crystal structure of the SRP- 
FtsY complex’ that captured the interacting 
GTPase domains bound to a sequence near 
the distal end of the SRP RNA hairpin. This 
configuration was unexpected, because previ- 
ous work* had shown that the complex is sta- 
bilized by bonds between the FtsY GTPase and 
a 4-nucleotide turn (a tetraloop) at the top of 
the hairpin. The structure’ suggested that the 
two domains might travel together to a remote 
‘landing pad’ more than 100 angstroms away 
after first interacting at the tetraloop. Such 
spectacular molecular gymnastics would be 
possible only because the two domains of Ffh 
are connected by a flexible linker. 

To obtain direct evidence for this putative 
large-scale movement and to glean its func- 
tional significance, Shen et al. attached a dye 
to the FtsY GTPase domain and a different dye 
to the distal end of SRP RNA. They then meas- 
ured the distance between the dyes through- 
out the SRP cycle using an innovative method 
called single-molecule fluorescence resonance 
energy transfer (smFRET). In this way, they 
could obtain precise positional information 
about individual Ffh-FtsY GTPase complexes. 

The authors made several intriguing 
observations. They not only showed that the 
Ffh-FtsY GTPase complex relocates to the far 
end of the SRP RNA hairpin, but also obtained 
evidence that the complex undergoes a lengthy 
‘search process before landing at its new des- 
tination. This migration is stimulated by 
SecYEG and requires a conformational change 
within the GTPase complex that primes it for 
GTP hydrolysis. The relocation of the com- 
plex is also required for both GTP hydrolysis 
and effective delivery of the nascent protein 
to SecYEG. Taken together, these results sug- 
gest that the GTPase complex is brought to the 
brink of activation, whereupon its relocation to 
the distal end of the RNA exposes the nascent 
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protein-translocation channel. c, Shen et al.' report that SecYEG stimulates 
a dramatic relocalization of the two interacting GTPases from 

the tetraloop region of SRP RNA to an evolutionarily conserved distal 
sequence. d, Reorientation of the GTPase complex facilitates both the 
docking of the translating ribosome onto SecYEG and hydrolysis of the two 
GTP nucleotides to GDP. GTP hydrolysis stimulates the recycling of SRP 


protein's exit site on the ribosome and allows 
the ribosome to dock onto SecYEG (Fig. Ic, d). 
The relocation also stimulates GTP hydrolysis 
by the complex and SRP recycling. 

Although Shen and colleagues’ work 
provides notable mechanistic insights, it also 
raises several questions. For one, how does 
SecYEG trigger the reorientation of the Ffh- 
FtsY GTPase complex and the subsequent 
hydrolysis of GTP? Also, in the likely event 
that the signal peptide remains bound to the 
M domain of Ffh upon relocalization of the 
GTPase complex, what catalyses cargo release? 
One possibility is that the movement of the 
complex reduces the affinity of Ffh for the sig- 
nal peptide by repositioning another segment 
of Ffh that has been proposed’ to participate in 
substrate recognition. Finally, how can these 
results be reconciled with the observation” 
that the distal end of SRP RNA is not required 
for acell to survive? It is conceivable that, even 
in the absence ofa distal binding site, transient 
displacement of the GTPase complex from the 
tetraloop allows the translating ribosomes to 
be transferred to SecYEG, albeit with reduced 
efficiency. 

Perhaps the most striking implication of 
Shen and co-workers’ study is that the two 
ends of the SRP RNA hairpin have distinct 
functions at different stages of the targeting 
cycle (Fig. 1). We take it for granted that pro- 
teins can be multifunctional, but much less 
is known about the capabilities of the many 
RNAs found in nature. As the characteriza- 
tion of RNA proceeds, the message that even a 
small nucleic-acid sequence can encode con- 
siderable functional information is certainly 
worth bearing in mind. = 


Harris D. Bernstein is in the Genetics and 
Biochemistry Branch, National Institute of 
Diabetes and Digestive and Kidney Diseases, 


National Institutes of Health, Bethesda, 
Maryland 20892-0538, USA. 
e-mail: harris_bernstein@nih.gov 


1. Shen, K,, Arslan, S., Akopian, D., Ha, T. & Shan, S. O. 
Nature 492, 271-275 (2012). 

2. Bradshaw, N., Neher, S. B., Booth, D. S. & Walter, P. 
Science 323, 127-130 (2009). 

3. Shan, S., Stroud, R. M. & Walter, P. PLoS Biol. 2, 
e320 (2004). 

4. Zhang, X., Schaffitzel, C., Ban, N. & Shan, S. 


ASTROPHYSICS 


Proc. Natl Acad. Sci. USA 106, 1754-1759 (2009). 

5. Angelini, S., Deitermann, S. & Koch, H. G. EMBO 
Rep. 6, 476-481 (2005). 

6. Gu, S.-Q., Peske, F., Wieden, H.-J., Rodnina, M. V. & 
Wintermeyer, W. RNA 9, 566-573 (2003). 

7. Ataide, S. F. et al. Science 331, 881-886 (2011). 

8. Shen, K. & Shan, S. Proc. Nat! Acad. Sci. USA 107, 
7698-7703 (2010). 

9. Newitt, J. A. & Bernstein, H. D. Eur J. Biochem. 245, 
720-729 (1997). 

10.Batey, R. T., Rambo, R. P., Lucast, L., Rha, B. & 
Doudna, J. A. Science 287, 1232-1239 (2000). 


A pas de trois birth for 
wide binary stars 


The widest binary star systems pose a challenge to theory: true stellar twins 
could not form so far apart. Simulations suggest these twins are in fact triplets, 
two of which masquerade as one star and cast out the third. SEE LETTER P.221 


KEIVAN GUADALUPE STASSUN 


bout half ofall stars are found in pairs, 

presumably because they were born as 

twins. Although there is no consensus 
theory for exactly how binary stars form, it is 
gratifying that most cases satisfy a basic check 
on the twin-birth hypothesis: that binary-star 
orbits be small enough to fit within the proto- 
stellar ‘cloud cores’ in which stars form. Most 
cases, that is, but not all. Some binary stars have 
orbits comparable in size to entire star clusters. 
In this issue (page 221), Reipurth and Mikkola’ 
describe simulations showing that such ultra- 
wide binary systems may be explained if they 
are born not as twins, but rather as triplets that 
undergo an intricate dance — a dance that at 
first protects but later expels one of the three 
siblings*. 

The existence of wide binary star systems has 
been known about for a long time’, and some 
of the best examples are very familiar. Our 
nearest stellar neighbour, Proxima Centauri, 
is a gravitationally bound’ companion of our 
next-nearest, a Centauri, even though the two 
are physically separated by about 0.1 parsecs 
(about 15,000 times the Earth-Sun separation, 
or about 500 times the size of the Solar Sys- 
tem). In the past few years, large astronomical 
surveys have discovered*” thousands of such 
systems (Fig. 1), comprising a few per cent of 
all binary star systems. 

One explanation is that the stars in these 
ultra-wide systems were not born together, 
but were brought together after birth. Indeed, 
simulations have shown®* that two stars can 
be paired through a mutual, ‘soft’ gravita- 
tional capture when a cluster of hundreds of 
stars slowly disperses after formation. This 


*This article and the paper under discussion! were 
published online on 5 December 2012. 


mechanism is appealing for at least three rea- 
sons. First, it explains why the widest binary 
systems have separations comparable to the 
sizes of star clusters. Second, it explains how 
such wide binaries survive the harsh, crowded 
environments of star clusters: they do not 
have to, because they come together only 
on leaving the cluster. Third, observations 
of wide binary systems show‘ a strikingly 
bimodal distribution of separations, which 
seems to be reproduced in cluster-dispersal 
simulations. 

The simulations by Reipurth and Mikkola 
represent a fresh take on the problem. They 
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follow the dynamical gravitational interaction 
of each of 180,000 stellar triplets, from their 
birth in cloud cores up to an age of 100 million 
years, when star clusters such as the Pleiades 
complete their formation and begin to dis- 
perse. These simulations mimic the variety of 
perturbations, collisions, flybys and slingshots 
that three gravitationally interacting bodies 
will exert on one another. 

In the authors’ simulations, 90% of the 
triplets disrupt themselves — but about 10% 
persist. Seen from a distance, these survivors 
do not resemble triplets so much as wide pairs: 
two of the three stars form a tight pair. The 
simulations show that these triplets survive 
because they initially have a compact config- 
uration — necessary to fit within their natal 
clouds — that permits them to withstand the 
jostling of a thousand stellar neighbours in 
the birth cluster. But over time, the three sib- 
lings develop a rivalry that brings two of them 
tightly together at the expense of the third, 
which is cast out to an increasingly distant 
orbit about the tight pair. These ‘hierarchical’ 
triplets, the authors argue, are what observers 
see as wide binaries: the tight pair masquerades 
as a single star owing to the limited imaging 
resolution of our telescopes. 

It remains to be determined whether this 
hypothesis is consistent with all of the obser- 
vational data on ultra-wide binaries. One high- 
resolution imaging study showed’ that as many 
as 30% of such binaries were indeed triplets. 
By contrast, an analysis of the radial velocities 
of ultra-wide binary systems found only about 
10% to be triplets’’. To ascertain whether all 
apparently wide binaries are in fact triplets, 
both types of measurement must be applied to 
a large sample of systems. Furthermore, certain 


Figure 1 | Wide binary star systems. Three-colour composite images of a sample of ultra-wide binary 
star systems, out of more than 1,300 such systems identified in the Sloan Digital Sky Survey’. Each image 
is 50 arcseconds on a side, corresponding to physical separations of the binary systems of up to 1 parsec. 
The systems include ‘identical twins’ (a-d) and non-identical twins (e-j). In some cases, the systems are 
found to be triplets (I-o) or even quadruplets (k). Reipurth and Mikkola show’ that one explanation for 
ultra-wide binary systems is that they are born as triplets. 
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assumptions of Reipurth and Mikkola’s simu- 
lations will need to be checked — such as that 
the three stars in a triplet begin their lives with 
equal masses. It will be necessary to assess how 
important these assumptions are to the study’s 
main conclusions. 

Ongoing surveys should be able to test 
some of Reipurth and Mikkola’s predictions, 
including the proportion of triplets in stellar 
populations of different ages and the number 
of triplets that are dominated by the mass in 
the tight binary as opposed to the mass of the 
third star. The work also presents speculations 
that may be testable by other means, such as 
the possibility that some tight pairs within 
triplets spiral towards one another and merge, 
creating a true wide binary. Moreover, as extra- 
solar planets continue to be found in various 
types of stellar family, it would be valuable to 
have quantitative predictions for the surviva- 
bility of planets from both the triplets-as-twins 
and cluster-dispersal mechanisms. As for our 
neighbouring wide binary, Proxima Centauri 
and a Centauri, it is in fact a triplet: a Centauri 
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is a tight pair, witha planet to boot'’. m 
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Clearing up the dust 


Constructing the history of star formation over cosmic time requires an 
understanding of how starlight is absorbed by dust in galaxies. It now seems that 
there is less universality in such absorption across galaxies than expected. 


AMY BARGER 


ne of the main tools for studying how 
galaxies form and evolve is to meas- 
ure the rate at which massive stars are 
born in galaxies per unit volume of the Uni- 
verse. However, some of the light emitted by 
stars in galaxies, especially light at ultraviolet 
and visible wavelengths, can be absorbed by 
dust (grains of silicon and carbon up to a few 
hundred micrometres in size) that is present 
in the galaxies. This absorption means that the 
rate at which massive stars are forming can- 
not be measured accurately. One way to solve 
this problem is to observe galaxies at longer 
wavelengths — the infrared regime — where 
the dust grains re-radiate the light. The ultra- 
violet and infrared data can then be combined 
for each galaxy to account completely for the 
emission produced by the stars. However, 
infrared data are not usually available for large 
samples of distant galaxies. Thus, a second 
approach is to try to apply a dust correction to 
the data. Writing in Astronomy & Astrophys- 
ics, Buat et al.! have attempted to determine 
how best to do this for a large sample of distant 
galaxies, at least in a statistical way. 
Deciding what wavelength-dependent dust 
corrections to apply to galaxies is a classic 


problem. Even for nearby galaxies, it is uncer- 
tain’ * whether there is a ‘bump’ in the dust 
extinction curve at 217.5 nanometres, which 
may relate to the grain size distribution or com- 
position; an extinction curve describes how 
the dust absorption changes with wavelength. 
Extinction curves along sight lines towards 
stars in the Milky Way show a strong bump at 
this wavelength. However, the bump is weaker 
towards stars in galaxies of the neighbouring 
Large and Small Magellanic Clouds. At higher 
redshifts, there is evidence” for the presence 
of such a bump in at least some galaxies, but 
the extinction curve of a local ‘starburst’ gal- 
axy’, which does not display a bump, is usually 
adopted in correcting for dust absorption in 
distant galaxies. (Redshift is what is measured 
directly by spectroscopy. Because of the Uni- 
verse’s expansion, the light from distant sources 
has been stretched, shifting the wavelengths 
towards the red end of the spectrum. The more 
the light is shifted to the red, the farther away, 
and thus the older, the source is.) 

Fortunately, the many new observations 
obtained for the distant Universe — includ- 
ing far-infrared data from the Herschel Space 
Telescope, mid-infrared data from the Spitzer 
Space Telescope and optical-to-near-infrared 
data from ground-based telescopes — have 
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made it possible to study dust absorption at 
high redshift. Buat et al. used all of these data 
to study the dependence of dust absorption on 
a variety of global galaxy properties, such as 
observed luminosity, stellar mass and the slope 
of the spectrum in the ultraviolet, for a large 
sample of galaxies with redshifts in the range 
0.95-2.2. At these redshifts, optical observa- 
tions sample the stars’ emission of ultraviolet 
light — the wavelength regime at which mas- 
sive stars emit most of their light and suffer the 
most absorption by dust. 

Buat and colleagues used a detailed com- 
puter code’ that derives properties of galaxies 
by fitting the light emission predicted from the 
computations to observations made across the 
wavelength spectrum. They concluded that 
the inclusion of infrared data improved the 
determination of dust-absorption param- 
eters. Interestingly, they found a wide range of 
values for these parameters, which they postu- 
late could be due to the large variations in the 
extinction curves for galaxies. They infer that 
a constant extinction curve for all galaxies at 
all redshifts cannot be assumed, and that vari- 
ations should be allowed for when computing 
the amount of reprocessed ultraviolet light. 

The authors’ finding that infrared data are 
crucial for studies of the effects of absorption 
by dust in individual galaxies makes sense — 
previous work has shown how notoriously dif- 
ficult it is to predict far-infrared emission from 
observed ultraviolet emission’. The study’s 
goal was to find dependences of the dust- 
absorption parameters on global galaxy prop- 
erties, which Buat et al. hoped to use to correct 
large samples of galaxies in a statistical way. 
Although they did identify some trends, there 
is large scatter about the averages of the param- 
eters that needs to be further understood. This 
serves as a reminder that if the galaxy property 
one is trying to calculate is strongly depend- 
ent on an assumed dust extinction curve, then 
one should be cautious about the result. Stud- 
ies such as this are essential for determining 
the reliability of dust-absorption corrections. 
These corrections are pivotal in determining 
the extent to which massive star formation in 
galaxies is hidden from optical view. m 
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assumptions of Reipurth and Mikkola’s simu- 
lations will need to be checked — such as that 
the three stars in a triplet begin their lives with 
equal masses. It will be necessary to assess how 
important these assumptions are to the study’s 
main conclusions. 

Ongoing surveys should be able to test 
some of Reipurth and Mikkola’s predictions, 
including the proportion of triplets in stellar 
populations of different ages and the number 
of triplets that are dominated by the mass in 
the tight binary as opposed to the mass of the 
third star. The work also presents speculations 
that may be testable by other means, such as 
the possibility that some tight pairs within 
triplets spiral towards one another and merge, 
creating a true wide binary. Moreover, as extra- 
solar planets continue to be found in various 
types of stellar family, it would be valuable to 
have quantitative predictions for the surviva- 
bility of planets from both the triplets-as-twins 
and cluster-dispersal mechanisms. As for our 
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understanding of how starlight is absorbed by dust in galaxies. It now seems that 
there is less universality in such absorption across galaxies than expected. 
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ne of the main tools for studying how 
galaxies form and evolve is to meas- 
ure the rate at which massive stars are 
born in galaxies per unit volume of the Uni- 
verse. However, some of the light emitted by 
stars in galaxies, especially light at ultraviolet 
and visible wavelengths, can be absorbed by 
dust (grains of silicon and carbon up to a few 
hundred micrometres in size) that is present 
in the galaxies. This absorption means that the 
rate at which massive stars are forming can- 
not be measured accurately. One way to solve 
this problem is to observe galaxies at longer 
wavelengths — the infrared regime — where 
the dust grains re-radiate the light. The ultra- 
violet and infrared data can then be combined 
for each galaxy to account completely for the 
emission produced by the stars. However, 
infrared data are not usually available for large 
samples of distant galaxies. Thus, a second 
approach is to try to apply a dust correction to 
the data. Writing in Astronomy & Astrophys- 
ics, Buat et al.! have attempted to determine 
how best to do this for a large sample of distant 
galaxies, at least in a statistical way. 
Deciding what wavelength-dependent dust 
corrections to apply to galaxies is a classic 


problem. Even for nearby galaxies, it is uncer- 
tain’ * whether there is a ‘bump’ in the dust 
extinction curve at 217.5 nanometres, which 
may relate to the grain size distribution or com- 
position; an extinction curve describes how 
the dust absorption changes with wavelength. 
Extinction curves along sight lines towards 
stars in the Milky Way show a strong bump at 
this wavelength. However, the bump is weaker 
towards stars in galaxies of the neighbouring 
Large and Small Magellanic Clouds. At higher 
redshifts, there is evidence” for the presence 
of such a bump in at least some galaxies, but 
the extinction curve of a local ‘starburst’ gal- 
axy’, which does not display a bump, is usually 
adopted in correcting for dust absorption in 
distant galaxies. (Redshift is what is measured 
directly by spectroscopy. Because of the Uni- 
verse’s expansion, the light from distant sources 
has been stretched, shifting the wavelengths 
towards the red end of the spectrum. The more 
the light is shifted to the red, the farther away, 
and thus the older, the source is.) 

Fortunately, the many new observations 
obtained for the distant Universe — includ- 
ing far-infrared data from the Herschel Space 
Telescope, mid-infrared data from the Spitzer 
Space Telescope and optical-to-near-infrared 
data from ground-based telescopes — have 
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made it possible to study dust absorption at 
high redshift. Buat et al. used all of these data 
to study the dependence of dust absorption on 
a variety of global galaxy properties, such as 
observed luminosity, stellar mass and the slope 
of the spectrum in the ultraviolet, for a large 
sample of galaxies with redshifts in the range 
0.95-2.2. At these redshifts, optical observa- 
tions sample the stars’ emission of ultraviolet 
light — the wavelength regime at which mas- 
sive stars emit most of their light and suffer the 
most absorption by dust. 

Buat and colleagues used a detailed com- 
puter code’ that derives properties of galaxies 
by fitting the light emission predicted from the 
computations to observations made across the 
wavelength spectrum. They concluded that 
the inclusion of infrared data improved the 
determination of dust-absorption param- 
eters. Interestingly, they found a wide range of 
values for these parameters, which they postu- 
late could be due to the large variations in the 
extinction curves for galaxies. They infer that 
a constant extinction curve for all galaxies at 
all redshifts cannot be assumed, and that vari- 
ations should be allowed for when computing 
the amount of reprocessed ultraviolet light. 

The authors’ finding that infrared data are 
crucial for studies of the effects of absorption 
by dust in individual galaxies makes sense — 
previous work has shown how notoriously dif- 
ficult it is to predict far-infrared emission from 
observed ultraviolet emission’. The study’s 
goal was to find dependences of the dust- 
absorption parameters on global galaxy prop- 
erties, which Buat et al. hoped to use to correct 
large samples of galaxies in a statistical way. 
Although they did identify some trends, there 
is large scatter about the averages of the param- 
eters that needs to be further understood. This 
serves as a reminder that if the galaxy property 
one is trying to calculate is strongly depend- 
ent on an assumed dust extinction curve, then 
one should be cautious about the result. Stud- 
ies such as this are essential for determining 
the reliability of dust-absorption corrections. 
These corrections are pivotal in determining 
the extent to which massive star formation in 
galaxies is hidden from optical view. m 
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Conservation in the red 


The Convention on Biological Diversity has pledged to reduce species-extinction 
threats around the globe by 2020. Analysis shows that this goal is achievable but 
requires a significant increase in the current rate of investment. 


STEPHEN POLASKY 


often contain lofty goals, but gathering 

the necessary resources and commitment 
to implement such agreements has proven far 
more difficult than signing them. Over recent 
decades, numerous target dates have passed 
with little or no progress made. Writing in 
Science, McCarthy et al.’ provide a financial 
assessment of the prospects for meeting the 
targets of the international Convention on 
Biological Diversity. The good news is that 
meeting the convention's goals for 2020 is 
achievable without breaking the bank — the 
authors’ estimated costs to reduce the extinc- 
tion risk for all globally threatened species 
is only about 0.1% of global gross domestic 
product. The bad news, however, is that this 
level of spending far outstrips the amount 
currently spent. If this financing gap contin- 
ues, then these conservation targets will also 
remain unmet. 

The Convention on Biological Diversity 
(CBD) was signed at the United Nations Con- 
ference on Environment and Development 
(also known as the Earth Summit) in Rio de 
Janeiro in 1992. After a decade of little pro- 
gress, the CBD member states agreed to set a 
target of achieving a significant reduction in 
the rate of biodiversity loss by 2010. But 2010 
came and went and biodiversity loss con- 
tinued unabated. Towards the end of 2010, 
the member states agreed to a new strategic 
plan, which included targets to prevent the 
extinction of all threatened species and to 
improve their conservation status, as well as 
protect 17% of terrestrial ecosystems, by 2020. 
McCarthy and colleagues sought to assess 
what these goals mean in dollar terms, and 
how likely it is that they will be achieved from 
a financial perspective. 

The evidence base from which one can 
estimate the cost of reducing extinction risk 
is relatively poor, and the methods McCarthy 
et al. used are ‘back-of-the-envelope calcula- 
tions. The authors surveyed 236 experts in bird 
conservation, who supplied estimates of the 
cost of improving the conservation status of a 
species about which they were knowledgeable. 
The survey covered 211 globally threatened 
bird species, and McCarthy and collabora- 
tors extrapolated these per-species estimates 
to cover all 1,115 threatened bird species. 
They then determined an upper-bound 


[ete environmental agreements 


Figure 1 | Dollars for diversity. McCarthy et al.' 
have estimated that the international community 
will have to spend US$76.1 billion each year to 
protect and manage all terrestrial sites of global 
conservation significance, such as regions of the 
Mato Grosso state of Brazil, home to this collared 
anteater (Tamandua tetradactyla). 


cost estimate, based on the assumption that 
expenditure for each species is independent (in 
other words, that expenditure on one species 
would not help any other species), and a lower- 
bound estimate determined from a degree of 
shared expenditure. To further extrapolate 
these estimates to costs for non-bird species, 
the authors used data from New Zealand 
showing that birds are on average 4.2 times 
more expensive to conserve per species, and 
they factored in that there are approximately 
12 times more threatened non-bird species 
than threatened bird species. 

There is somewhat more evidence on which 
to base estimates for the costs of protecting 
terrestrial sites of biodiversity significance. 
We have global maps for ‘Important Bird 
Areas’ that are thought to contain habitat for 
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threatened, restricted-range or migratory 
birds’, and maps of areas that are already pro- 
tected’. McCarthy et al. combined these maps 
with information on management costs and 
rough estimates of land-purchase costs* to 
work out the money required to protect impor- 
tant bird habitat. They then used these values, 
together with estimates of the land needed for 
habitat to conserve non-bird threatened spe- 
cies, to derive a total amount. 

The authors’ final estimates are that 
between US$3.41 billion and $4.76 billion 
would need to be spent annually to reduce 
the threat of species extinctions, and that the 
cost of protecting and managing all terrestrial 
sites of global conservation significance would 
be $76.1 billion annually (Fig. 1). This might 
sound like a lot of money, until one consid- 
ers that the value of the global economy is 
roughly $70 trillion per year. But, according 
to the present paper, the estimates are still ten 
times larger than the amount currently being 
invested in conservation. 

The actual costs of protecting biodiversity 
could easily be higher (or lower) than these 
values. For example, the recent rise in food and 
commodity prices has increased land prices 
around the globe; in the United States, the 
average price of farmland in real terms roughly 
doubled between 1990 and 2010 (ref. 5). Fur- 
thermore, many additional threats that will 
variably affect species, including climate 
change, introduced pests and pathogens, pol- 
lution, poaching, or many of these combined, 
could make it much harder and more costly to 
conserve species. On the other hand, species 
protection will bring with it other ecosystem 
services that could significantly reduce the 
net cost of conservation. For example, some 
areas that are targeted for protection also sup- 
ply drinking water for major cities — here, 
the benefits of improved ecosystem-service 
provision alone are surely enough to justify 
conservation. 

Although the data and methods used in this 
paper are simple and subject to potentially 
significant error, they still probably get the big 
picture right. McCarthy and colleagues show 
that the costs of providing adequate protec- 
tion for biodiversity are substantial yet clearly 
affordable, but that they are much larger than 
the sums currently being devoted to conserva- 
tion. Obtaining more refined estimates might 
be desirable, but will require better data and 
more detailed methods — and more exact 
numbers are unlikely to alter either of these 
two main conclusions. 

In the end, whether or not the 2020 CBD 
targets will be met rests primarily on making 
the political and economic argument that the 
benefits of conserving biodiversity outweigh 
the costs. This point may be self-evident to 
most biologists, but it is not to many politi- 
cians. McCarthy et al. have supplied a piece 
of useful evidence on the costs of conserving 
biodiversity and have shown that they are 
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affordable. What is needed now is compelling 
evidence that conserving biodiversity is essen- 
tial for human well-being’, in the hope that this 
will convince decision-makers that conserva- 
tion is well worth the investment. m 
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Death by deacetylation 


Necrosis is associated with various diseases, yet it is arguably the least-understood 
form of programmed cell death. It emerges that a sirtuin protein regulates one 
form of necrosis through a deacetylation reaction. SEE ARTICLE P.199 


WEN ZHOU & JUNYING YUAN 


death is simply a passive process resulting 
from severe stress on a cell, due to infection 
or trauma, rather than a regulated mechanism. 
But there is increasing evidence that at least 
one form of necrosis is mediated by a cellu- 
lar program called necroptosis’. In this issue 
(page 199), Narayan et al.” add to this body of 
evidence by showing that necroptosis is regu- 
lated by the deacetylase enzyme sirtuin-2. 
Deciphering the molecular basis of this pro- 
cess is an exciting prospect, not least because 
dying cells showing characteristics of necrosis 
are found in various diseases. Mechanistic 
insight into necroptosis might therefore pro- 
vide opportunities for developing inhibitors 
that block disease-related cell death”. 
Necroptosis is activated by TNF-a, an 
immune-mediator protein that is associated 
with diseases such as rheumatoid arthritis’ and 
inflammatory bowel disease*. When this pro- 
tein interacts with its receptor TNFRI1 on the 
cell membrane, a cytoplasmic complex called 
complex I rapidly forms, binding to the recep- 
tor’s intracellular domain. Among the proteins 
that make up complex I, RIP1 is involved in 
activating the transcription factor NF-«B and 
thereby counteracting another form of pro- 
grammed cell death called apoptosis (Fig. 1a). 
RIP1 is a kinase enzyme, which modifies the 
activity of its substrates by adding phosphate 
groups to them. Its amino-terminal domain 
is essential for mediating necroptosis. If cas- 
pase-8, akey enzyme in apoptosis, is inhibited, 
RIP1 leaves complex I. It then interacts with 
RIP3 — another member of the RIP enzyme 
family — to form complex IIb. The formation 
of complex IIb is essential for necroptosis”’, 
and so the biochemical mechanisms that 


I: has long been thought that necrotic cell 


*This article and the paper under discussion? were 
published online on 28 November 2012. 


regulate it are under intense investigation. 

Narayan et al. present intriguing data that 
implicate sirtuin-2 in mediating the formation 
of complex IIb. Sirtuin-2 belongs to the sirtuin 
family of enzymes, which are deacetylases, 
removing acetyl groups from their substrates. 
Notably, deacetylases have been linked to the 
regulation of transcription, apoptosis, stress 
resistance and ageing®. While searching for 
binding partners of sirtuin-2, the authors came 
upon RIP3. They also found that, in cells lacking 
sirtuin-2 or in the presence of pharmacologi- 
cal inhibitors of this protein, TNF-a-mediated 
necroptosis is blocked. This finding suggests 
that sirtuin-2-mediated protein deacetylation 
is involved in controlling necrosis. 

Indeed, Narayan and colleagues show that, 
as well as binding to RIP3, sirtuin-2 deacety- 
lates RIP1 at a specific lysine amino-acid resi- 
due; this deacetylation seems to be required 
for the direct interaction between RIP1 and 
RIP3. Moreover, in the absence of sirtuin-2, 
the formation of complex Ib — defined as the 
interaction between RIP1 and RIP3 following 
TNF-a stimulation — does not occur (Fig. 1b). 

Consistent with the importance of RIP1 
deacetylation in necrosis, the researchers 
show that, on activation of necroptosis in cul- 
tured mouse cells, RIP1 acetylation markedly 
declines. This was also observed in a mouse 
model of a disorder known as heart ischae- 
mia reperfusion, in which the heart tissue is 
damaged when the blood supply to it is re- 
established after a period of limited supply. 
Inhibition of sirtuin-2 activity prevented 
RIP1 deacetylation and protected the heart 
from injury. Because necroptosis also medi- 
ates ischaemic brain injury’, and given that 
sirtuin-2 is abundantly present in the central 
nervous system, it is possible that this protein 
also contributes to necroptosis in neuro- 
degenerative diseases by deacetylating RIP1. 

Deacetylation of RIP1 by RIP3-bound 
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Figure 1 | Evolution of a death complex. a, When 
TNF-a binds to its receptor TNFR1, which is 

a protein that straddles the cell membrane, the 
receptor is activated and a group of proteins — 
collectively known as complex I — assembles on 
its cytoplasmic side. As part of complex I, RIP1 
mediates NF-«B activation and so promotes cell 
survival. b, RIP1 that dissociates from complex I 
is in the acetylated form, and this prevents a stable 
interaction between RIP1 and RIP3. Narayan 

et al.” show that the deacetylase enzyme sirtuin-2 
(SIRT2) interacts with RIP3 and is required 

to deacetylate RIP1. Deacetylated RIP1 forms 
another stable complex with RIP (complex IIb), 
leading to necroptosis. 


sirtuin-2 suggests that at least one reason for 
the formation of RIP1-RIP3 in complex IIb is 
to allow this deacetylation. A recent study’ sug- 
gests that RIP1-RIP3, as part of complex IIb, 
might be present in insoluble filamentous pro- 
tein clusters called amyloid structures, which 
are linked to various neurodegenerative dis- 
orders. It is possible, therefore, that protein 
deacetylation is involved in amyloid formation 
— and thus in these diseases — where it serves 
as a platform to maintain a stable RIP1-RIP3 
complex in the amyloid state. 

The enzymatic activity of sirtuin-2 depends 
on the level of its cofactor NAD*, a metabolic 
intermediate. Reduced activity of NAD” is 
associated with energy deficiency and oxi- 
dative stress in cells. This indicates that dea- 
cetylation — and, by inference, necroptosis 
— is subject to metabolic regulation. Although 
NAD* levels and sirtuin-2 activity are not 
altered during necroptosis, NAD” depletion 
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affordable. What is needed now is compelling 
evidence that conserving biodiversity is essen- 
tial for human well-being’, in the hope that this 
will convince decision-makers that conserva- 
tion is well worth the investment. m 
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Death by deacetylation 


Necrosis is associated with various diseases, yet it is arguably the least-understood 
form of programmed cell death. It emerges that a sirtuin protein regulates one 
form of necrosis through a deacetylation reaction. SEE ARTICLE P.199 


WEN ZHOU & JUNYING YUAN 


death is simply a passive process resulting 
from severe stress on a cell, due to infection 
or trauma, rather than a regulated mechanism. 
But there is increasing evidence that at least 
one form of necrosis is mediated by a cellu- 
lar program called necroptosis’. In this issue 
(page 199), Narayan et al.” add to this body of 
evidence by showing that necroptosis is regu- 
lated by the deacetylase enzyme sirtuin-2. 
Deciphering the molecular basis of this pro- 
cess is an exciting prospect, not least because 
dying cells showing characteristics of necrosis 
are found in various diseases. Mechanistic 
insight into necroptosis might therefore pro- 
vide opportunities for developing inhibitors 
that block disease-related cell death”. 
Necroptosis is activated by TNF-a, an 
immune-mediator protein that is associated 
with diseases such as rheumatoid arthritis’ and 
inflammatory bowel disease*. When this pro- 
tein interacts with its receptor TNFRI1 on the 
cell membrane, a cytoplasmic complex called 
complex I rapidly forms, binding to the recep- 
tor’s intracellular domain. Among the proteins 
that make up complex I, RIP1 is involved in 
activating the transcription factor NF-«B and 
thereby counteracting another form of pro- 
grammed cell death called apoptosis (Fig. 1a). 
RIP1 is a kinase enzyme, which modifies the 
activity of its substrates by adding phosphate 
groups to them. Its amino-terminal domain 
is essential for mediating necroptosis. If cas- 
pase-8, akey enzyme in apoptosis, is inhibited, 
RIP1 leaves complex I. It then interacts with 
RIP3 — another member of the RIP enzyme 
family — to form complex IIb. The formation 
of complex IIb is essential for necroptosis”’, 
and so the biochemical mechanisms that 


I: has long been thought that necrotic cell 


*This article and the paper under discussion? were 
published online on 28 November 2012. 


regulate it are under intense investigation. 

Narayan et al. present intriguing data that 
implicate sirtuin-2 in mediating the formation 
of complex IIb. Sirtuin-2 belongs to the sirtuin 
family of enzymes, which are deacetylases, 
removing acetyl groups from their substrates. 
Notably, deacetylases have been linked to the 
regulation of transcription, apoptosis, stress 
resistance and ageing®. While searching for 
binding partners of sirtuin-2, the authors came 
upon RIP3. They also found that, in cells lacking 
sirtuin-2 or in the presence of pharmacologi- 
cal inhibitors of this protein, TNF-a-mediated 
necroptosis is blocked. This finding suggests 
that sirtuin-2-mediated protein deacetylation 
is involved in controlling necrosis. 

Indeed, Narayan and colleagues show that, 
as well as binding to RIP3, sirtuin-2 deacety- 
lates RIP1 at a specific lysine amino-acid resi- 
due; this deacetylation seems to be required 
for the direct interaction between RIP1 and 
RIP3. Moreover, in the absence of sirtuin-2, 
the formation of complex Ib — defined as the 
interaction between RIP1 and RIP3 following 
TNF-a stimulation — does not occur (Fig. 1b). 

Consistent with the importance of RIP1 
deacetylation in necrosis, the researchers 
show that, on activation of necroptosis in cul- 
tured mouse cells, RIP1 acetylation markedly 
declines. This was also observed in a mouse 
model of a disorder known as heart ischae- 
mia reperfusion, in which the heart tissue is 
damaged when the blood supply to it is re- 
established after a period of limited supply. 
Inhibition of sirtuin-2 activity prevented 
RIP1 deacetylation and protected the heart 
from injury. Because necroptosis also medi- 
ates ischaemic brain injury’, and given that 
sirtuin-2 is abundantly present in the central 
nervous system, it is possible that this protein 
also contributes to necroptosis in neuro- 
degenerative diseases by deacetylating RIP1. 

Deacetylation of RIP1 by RIP3-bound 
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Figure 1 | Evolution of a death complex. a, When 
TNF-a binds to its receptor TNFR1, which is 

a protein that straddles the cell membrane, the 
receptor is activated and a group of proteins — 
collectively known as complex I — assembles on 
its cytoplasmic side. As part of complex I, RIP1 
mediates NF-«B activation and so promotes cell 
survival. b, RIP1 that dissociates from complex I 
is in the acetylated form, and this prevents a stable 
interaction between RIP1 and RIP3. Narayan 

et al.” show that the deacetylase enzyme sirtuin-2 
(SIRT2) interacts with RIP3 and is required 

to deacetylate RIP1. Deacetylated RIP1 forms 
another stable complex with RIP (complex IIb), 
leading to necroptosis. 


sirtuin-2 suggests that at least one reason for 
the formation of RIP1-RIP3 in complex IIb is 
to allow this deacetylation. A recent study’ sug- 
gests that RIP1-RIP3, as part of complex IIb, 
might be present in insoluble filamentous pro- 
tein clusters called amyloid structures, which 
are linked to various neurodegenerative dis- 
orders. It is possible, therefore, that protein 
deacetylation is involved in amyloid formation 
— and thus in these diseases — where it serves 
as a platform to maintain a stable RIP1-RIP3 
complex in the amyloid state. 

The enzymatic activity of sirtuin-2 depends 
on the level of its cofactor NAD*, a metabolic 
intermediate. Reduced activity of NAD” is 
associated with energy deficiency and oxi- 
dative stress in cells. This indicates that dea- 
cetylation — and, by inference, necroptosis 
— is subject to metabolic regulation. Although 
NAD* levels and sirtuin-2 activity are not 
altered during necroptosis, NAD” depletion 


occurs during the acute phase of ischaemia, 
owing to lack of oxygen. This might explain 
why there is initially no extensive cell death 
during ischaemia, but that it instead occurs 
after reperfusion, when NAD" levels are once 
again replenished. There could therefore 
be a window of opportunity for inhibiting 
ischaemic injury by blocking necroptosis 
before reperfusion occurs. 

Little is known about the composition 
of complex IIb and how it is regulated. The 
present study reveals one aspect of this complex, 
together with a new layer in the regulation of 
necroptosis. But there is still much to be learnt. 
For instance, given that acetylated RIP1 does 
not interact with RIP3, how does RIP3 bring sir- 
tuin-2 to RIP 1? It could be that RIP1 and RIP3 
form a transient interaction complex that must 
be stabilized by RIP1 deacetylation. It is equally 
likely that there are other scaffold proteins that 
bring RIP1 and RIP3 together. Also, the kinase 
activity of RIP1 and RIP3 is essential for their 
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interaction: the two proteins both phosphoryl- 
ate themselves and each other. But it is unclear 
how phosphorylation and deacetylation are 
coordinated to achieve the active state of com- 
plex IIb. Finding answers to these questions will 
bea major goal, especially given the relevance of 
necroptosis to human disease. = 
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Molecular hurdles 
cleared with ease 


Single- molecule studies reveal that a ring-like enzyme that encircles and ‘slides’ 
along one strand of duplex DNA, separating it from the other strand, overcomes 
molecular barriers in its path by transiently opening its ring. SEE ARTICLE P.205 


MICHAEL A. TRAKSELIS & BRIAN W. GRAHAM 


F« all the benefits that are gained by 


encoding genetic information in double- 

stranded DNA, the long, linear, anti- 
parallel arrangement of complementary 
strands in DNA duplexes provides challenges 
for enzymes that replicate genomes. Many 
proteins involved in replication have therefore 
adopted oligomeric, toroidal conformations 
that allow them to remain bound to, and to act 
on, DNA over long distances. One such protein 
complex is the DNA helicase, which regulates 
the initiation of replication, separates duplex 
strands and can traverse many obstacles 
in its path. 

Most organisms have a circular DNA-rep- 
lication helicase consisting of six subunits. 
The large T antigen of simian virus 40 is a 
well-characterized example, and serves as a 
model for deciphering the mechanism of DNA 
unwinding in eukaryotes (organisms that 
include plants, animals and fungi). Conflicting 
evidence has fuelled a debate about whether a 
single or a double hexamer of large T antigen 
encircles and acts on single-stranded DNA 


*This article and the paper under discussion‘ were 
published online on 28 November 2012. 


(ssDNA) or double-stranded DNA (dsDNA) 
during unwinding. On page 205 of this issue, 
Yardimci et al.' convincingly show that a dou- 
ble hexamer of large T antigen assembles at 
replication origins, and then separates into 
two single hexamers, which unwind dsDNA by 
encircling and translocating along each ssDNA 
in the 3’-to-5’ direction*. 

In addition to its hexameric unwinding 
ability, the authors describe the surprising abil- 
ity of large T antigen to bypass large objects 
that are covalently attached to the encir- 
cled translocating strand, highlighting the 
hexameric structure's remarkable plasticity. 
Although other hexameric helicases have 
been observed to have similar translocating 
and unwinding abilities in single-molecule 
studies” °, this is the first report of a helicase 
overcoming a covalent block to unwinding. 

Replication helicases face three main 
challenges. First, the enzymes must intimately 
associate with the DNA, but at the same time 
be able to slide efficiently along it over great 
(molecular) distances. Second, separation of 
the stable duplex into single strands is required 
before each of the strands can be replicated. 
And third, genomes are huge, and the strands 
of duplex DNA are bound by an array of 


13 DECEMBER 2012 | VOL 492 | NATURE | 195 


© 2012 Macmillan Publishers Limited. All rights reserved 


occurs during the acute phase of ischaemia, 
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during ischaemia, but that it instead occurs 
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be a window of opportunity for inhibiting 
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along one strand of duplex DNA, separating it from the other strand, overcomes 
molecular barriers in its path by transiently opening its ring. SEE ARTICLE P.205 
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F« all the benefits that are gained by 


encoding genetic information in double- 

stranded DNA, the long, linear, anti- 
parallel arrangement of complementary 
strands in DNA duplexes provides challenges 
for enzymes that replicate genomes. Many 
proteins involved in replication have therefore 
adopted oligomeric, toroidal conformations 
that allow them to remain bound to, and to act 
on, DNA over long distances. One such protein 
complex is the DNA helicase, which regulates 
the initiation of replication, separates duplex 
strands and can traverse many obstacles 
in its path. 

Most organisms have a circular DNA-rep- 
lication helicase consisting of six subunits. 
The large T antigen of simian virus 40 is a 
well-characterized example, and serves as a 
model for deciphering the mechanism of DNA 
unwinding in eukaryotes (organisms that 
include plants, animals and fungi). Conflicting 
evidence has fuelled a debate about whether a 
single or a double hexamer of large T antigen 
encircles and acts on single-stranded DNA 
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published online on 28 November 2012. 


(ssDNA) or double-stranded DNA (dsDNA) 
during unwinding. On page 205 of this issue, 
Yardimci et al.' convincingly show that a dou- 
ble hexamer of large T antigen assembles at 
replication origins, and then separates into 
two single hexamers, which unwind dsDNA by 
encircling and translocating along each ssDNA 
in the 3’-to-5’ direction*. 

In addition to its hexameric unwinding 
ability, the authors describe the surprising abil- 
ity of large T antigen to bypass large objects 
that are covalently attached to the encir- 
cled translocating strand, highlighting the 
hexameric structure's remarkable plasticity. 
Although other hexameric helicases have 
been observed to have similar translocating 
and unwinding abilities in single-molecule 
studies” °, this is the first report of a helicase 
overcoming a covalent block to unwinding. 

Replication helicases face three main 
challenges. First, the enzymes must intimately 
associate with the DNA, but at the same time 
be able to slide efficiently along it over great 
(molecular) distances. Second, separation of 
the stable duplex into single strands is required 
before each of the strands can be replicated. 
And third, genomes are huge, and the strands 
of duplex DNA are bound by an array of 
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different proteins along its length, which may 
interfere with replication. 

Previous structural studies®* have indi- 
cated that a double hexamer of large T antigen 
pumps dsDNA into the protein's central chan- 
nel (Fig. 1a), where it separates into two ssDNA 
sequences that are extruded out of side chan- 
nels to form rabbit-ear-like loops. This double- 
hexamer structure was predicted to remain 
intact during the entire course of unwinding. 
Large T antigen also initiates DNA replication 
by binding specifically to replication origins 
as a double hexamer. The assembly of a dou- 
ble hexamer is an intriguing initiation activity, 
but it is inconsistent with nearly every other 
unwinding mechanism identified for replica- 
tion helicases, in which single hexamers act 
through a steric exclusion mode: they encircle 
one strand while excluding the other. 

To address this inconsistency, Yardimci 
et al. performed single-molecule studies of 
large T antigen in the process of unwinding 
origin-containing dsDNA. They tethered 
dsDNA to a surface in a microfluidic flow 
chamber and added large T antigen so that it 
bound to the DNA. They then added a cell 
extract containing replication proteins, to 
initiate DNA replication, followed by a type 
of nucleotide that, when incorporated into 
replicated dsDNA, could be tagged with a 
fluorescent label. By monitoring the growth of 
newly replicated, fluorescently labelled DNA, 
Yardimci et al. calculated the fork rate (the rate 
of bidirectional replication). They observed 
that when both ends of the original DNA 
were tethered to the surface such that it was 
stretched to prevent rabbit-ear ssDNA from 
forming, the fork rate was the same as that of 
DNA tethered at just one end. This isa telling 
result — if large T antigen unwinds DNA as 
a double hexamer, then its ability to unwind 
the doubly tethered DNA should be reduced, 
because it cannot pump such dsDNA into its 
central channel. The authors’ results therefore 
show that the large T antigen double hexamer 
separates into single hexamers, which each 
unwind DNA in opposite directions from the 
origin (Fig. 1b). 

To further distinguish whether ssDNA or 
dsDNA passes through large T antigen’s cen- 
tral channel, Yardimci et al. tested the helicase’s 
ability to unwind dsDNA in which the protein 
streptavidin was non-covalently attached to 
the leading and/or lagging strands. If dsDNA 
is encircled by large T antigen, streptavidin 
would be removed ifit is on either of the single 
strands, or on both strands; but ifssDNA alone 
is encircled by the helicase, streptavidin would 
be removed only from the translocating strand. 
Consistent with a single hexamer encircling 
only one strand, the authors observed strepta- 
vidin displacement from the leading strand but 
not from the lagging strand. The researchers 
also performed single-molecule experiments 
in which dsDNA was non-covalently tagged 
with large fluorescent labels called quantum 
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Figure 1 | Mechanism of DNA unwinding by the large T antigen of simian virus 40. Hexameric DNA- 
replication helicase enzymes, such as large T antigen, bind to double-stranded DNA (dsDNA) at sites known 
as replication origins before separating the duplex into single strands. a, Yardimci et al.' report that large 

T antigen binds dsDNA as a double hexamer. b, The double hexamer then splits into two single hexamers 
that each track along a ‘leading’ strand in the 3’-to-5’ direction while excluding the other, ‘lagging; strand, 
unwinding dsDNA as they go. c, If the hexamers encounter a ‘roadblock; such as a protein covalently 
attached to the leading strand, they can bypass it efficiently without dissociating from the DNA. 


dots, instead of streptavidin. These experi- 
ments confirmed that unwinding occurs in the 
3’-to-5’ direction, with the helicase travelling 
along the leading strand. 

Surprisingly, Yardimci et al. found that 
many of the bulky quantum dots on the lead- 
ing strand were bypassed by large T antigen, 
without affecting the unwinding rate. To pro- 
vide a more natural block to unwinding, the 
authors covalently attached a protein called 
Hpall methyltransferase (M.Hpall) to either 
the leading or the lagging strand. Amazingly, 
in both cases, unwinding occurred with equal 
efficiency in spite of the covalent block. 

To test whether the helicase remains 
attached to DNA on encountering a covalently 
attached blockage, or whether a new helicase 
is recruited instead, the authors assembled 
large T antigen on DNA, washed away any 
unbound protein and then mixed the large 
T antigen-DNA assembly with replication 
protein A to initiate unwinding. Once again, 
they observed that the unwinding ability of 
large T antigen was not significantly differ- 
ent when M.Hpall was covalently attached 
to either the leading or the lagging strands, 
which suggests that a preassembled helicase 
is able to bypass the ‘roadblock (Fig. 1c). 
When the researchers attached two M.Hpall 
proteins to DNA 60 bases apart on the 
leading strand, the unwinding rate of the 
DNA by large T antigen was again similar to 
that of unadulterated DNA. However, when 
Yardimci et al. tested DNA labelled with pro- 
teins attached just 14 bases apart, unwinding 
was strongly inhibited. It will be interesting to 
determine whether the second block limits 
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large T antigen’s ability to remain bound to 
the DNA in this case, or whether the proxim- 
ity of the blocks prevents large T antigen from 
reassembling around ssDNA. 

The authors suggest that plasticity within 
the hexamer ring is responsible for the tran- 
sient opening of translocating large T antigen, 
and that this opening overcomes natural bar- 
riers to unwinding. They also argue that mul- 
tiple blocks located close together prevent the 
reclosing of large T antigen, or stimulate the 
protein’s dissociation from DNA. It is known 
that the asymmetrical hexameric arrange- 
ment of subunits in the eukaryotic replica- 
tion helicase MCM2-7 — which is thought to 
have an analogous structure and unwinding 
mode to large T antigen — results in a labile 
interface between this helicase’s MCM2 and 
MCM5 subunits’. This is thought to facili- 
tate the opening and loading of MCM2-7 at 
replication origins, but might also allow the 
helicase to open transiently, to bypass protein 
roadblocks as it unwinds DNA. 

It is now important to reconcile exactly how 
large T antigen bypasses impediments. For 
example, how are the molecular interactions 
with DNA in the central channel of the anti- 
gen disrupted by a roadblock? If the hexamer 
opens transiently, how does it remain associ- 
ated with the DNA? Does the degree of open- 
ing depend on the size of the roadblock, and 
what is the mechanism for reclosure around 
ssDNA afterwards? Perhaps, after opening of 
a hexamer, large T antigen jumps or rotates 
over a roadblock before reassociating with 
the DNA, akin to a molecular hurdler. Or per- 
haps interactions with the lagging strand on 


the external surface of the large T antigen are 
required to keep it from fully dissociating from 
DNA, similarly to what has been observed for 
an archaeal hexameric helicase’®. 

Large T antigen, despite having served as 
a model for studies of the unwinding mecha- 
nisms of hexameric helicases for years, con- 
tinues to surprise scientists with its abilities. 
It will be interesting to see whether other 
hexameric helicases share these abilities, or if 
there are specific differences between them. 
In the meantime, Yardimci et al. have greatly 
advanced our understanding of how helicases 
unwind long stretches of DNA in the presence 
of competing cellular processes. m 
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Molecules that convert 
heat into light 


Aclass of fluorescent organic molecule has been designed that enables highly 
efficient light-emitting diodes to be made. The devices may turn out to be 
competitors to their conventional analogues. SEE LETTER P.234 


BRIAN D’ANDRADE 


not least because they are the means by 

which people interact with the Internet. 
The mobile display market has seen tremen- 
dous growth’, for example, with more than 
1 billion mobile phones being sold in 2006. 
Liquid-crystal displays are a dominant tech- 
nology for display applications, but organic 
light-emitting diodes (OLEDs) have steadily 
been incorporated into mobile devices and 
are increasingly being used in televisions. On 
page 234 of this issue, Uoyama et al.’ report a 
type of OLED that may offer an alternative to 
its conventional equivalents. 

One of the unique features ofan OLED dis- 
play is that every pixel has an OLED, which 
is essentially a micrometre-sized light source. 
The millions of red, green and blue OLEDs ina 
display all have to be efficient, manufacturable 
with good yields and capable of providing a 
highly consistent light output. These require- 
ments have been achieved by improving the 
organic materials used in OLEDs. 

One of the key drivers of OLED technology 
is novel chemistry. New molecules constantly 
offer surprises that challenge conventional 
wisdom and offer alternative solutions. Tang 
and colleagues’ introduced the world to mod- 
ern OLED designs 25 years ago. However, the 


lectronic displays are almost ubiquitous, 


devices were poor at converting input electric 
current to light because they incorporated 
inefficient fluorescent organic materials. Then, 
in 1998, a team led by Forrest (a physicist) and 
Thompson (a chemist) introduced* devices 
with phosphorescent molecules that enabled 
almost 100% efficient conversion. Phospho- 
rescent molecules differ from their fluorescent 
analogues in that they contain a heavy metal 
atom such as iridium. 

This work led to an explosion of phospho- 
rescent materials that enabled many desirable 
characteristics, such as a variety of colours, 
long operational lifetimes and high efficien- 
cies. However, it remains difficult to produce 
a single display that uses blue, green and red 
phosphorescent devices, because the opera- 
tional lifetimes and colour characteristics of 
some phosphorescent devices do not meet the 
strict performance requirements for display 
applications. It is important to solve these 
problems because phosphorescent materials 
are the most promising materials for displays, 
with the potential to deliver nearly 100% effi- 
ciency. Now, Uoyama et al. describe how they 
have made efficient OLEDs that use a special 
type of fluorescent material. 

When an organic molecule absorbs electri- 
cal energy, it moves from its original state to 
a higher-energy (excited) state. For high-effi- 
ciency devices, the excited molecule preferably 
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sheds the extra energy by producing a photon 
of light when it returns to its original, unex- 
cited state. The molecular transition from 
excited to unexcited states has a few strict 
rules related to quantum-mechanical condi- 
tions that must be satisfied. These rules pre- 
vent every excited molecule from emitting a 
photon and returning to an unexcited state. As 
a result, fluorescent materials used in OLEDs 
often generate heat instead of light. Phospho- 
rescent materials break those rules by using a 
heavy metal atom, such as iridium, bonded to 
the organic materials (typically comprising 
carbon, nitrogen and hydrogen) in an individ- 
ual molecule. This makes it significantly more 
likely that after absorbing electrical energy, a 
phosphorescent molecule can efficiently tran- 
sition from a high energy state to a low one by 
emitting a photon. 

Uoyama and colleagues developed fluo- 
rescent molecules that can achieve high effi- 
ciency by using heat instead of a heavy metal 
atom. They convincingly demonstrated the 
molecules’ potential by making highly effi- 
cient fluorescent OLEDs with a performance 
comparable to that of phosphorescent OLEDs. 
Their fluorescent molecules are based on 
carbazolyl dicyanobenzene and as such have 
the potential to be synthesized at low cost, 
Uoyama et al. believe, because the synthetic 
process does not require rare metal catalysts 
or rare heavy metal atoms. Furthermore, the 
OLEDs can emit multiple colours, from sky- 
blue (with a peak wavelength at 473 nanome- 
tres) to orange (577 nm). 

There are many challenges ahead before 
these molecules can be fully accepted into 
production. The emissive colours have to 
be developed for display applications; the 
operational lifetime of devices that incorpo- 
rate the molecules needs to be comparable 
to, or preferably better than, state-of-the-art 
devices (possibly the hardest challenge); and 
the ability to manufacture products based on 
the new OLEDs must be proven. The mol- 
ecules are not certain to replace existing ones, 
but they may be shaped into contenders in 
terms of the cost and quality of desirable per- 
formance characteristics. Indeed, perfecting 
the new molecules will take time and effort, 
but Chihaya Adachi, who leads the work and 
is aco-author of the study’, has the experience 
needed to help his team to overcome these 
obstacles. m 
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the external surface of the large T antigen are 
required to keep it from fully dissociating from 
DNA, similarly to what has been observed for 
an archaeal hexameric helicase’®. 

Large T antigen, despite having served as 
a model for studies of the unwinding mecha- 
nisms of hexameric helicases for years, con- 
tinues to surprise scientists with its abilities. 
It will be interesting to see whether other 
hexameric helicases share these abilities, or if 
there are specific differences between them. 
In the meantime, Yardimci et al. have greatly 
advanced our understanding of how helicases 
unwind long stretches of DNA in the presence 
of competing cellular processes. m 
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light-emitting diodes (OLEDs) have steadily 
been incorporated into mobile devices and 
are increasingly being used in televisions. On 
page 234 of this issue, Uoyama et al.’ report a 
type of OLED that may offer an alternative to 
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rescent molecules differ from their fluorescent 
analogues in that they contain a heavy metal 
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rescent materials that enabled many desirable 
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tional lifetimes and colour characteristics of 
some phosphorescent devices do not meet the 
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applications. It is important to solve these 
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have made efficient OLEDs that use a special 
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sheds the extra energy by producing a photon 
of light when it returns to its original, unex- 
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excited to unexcited states has a few strict 
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tions that must be satisfied. These rules pre- 
vent every excited molecule from emitting a 
photon and returning to an unexcited state. As 
a result, fluorescent materials used in OLEDs 
often generate heat instead of light. Phospho- 
rescent materials break those rules by using a 
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the organic materials (typically comprising 
carbon, nitrogen and hydrogen) in an individ- 
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likely that after absorbing electrical energy, a 
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sition from a high energy state to a low one by 
emitting a photon. 
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rescent molecules that can achieve high effi- 
ciency by using heat instead of a heavy metal 
atom. They convincingly demonstrated the 
molecules’ potential by making highly effi- 
cient fluorescent OLEDs with a performance 
comparable to that of phosphorescent OLEDs. 
Their fluorescent molecules are based on 
carbazolyl dicyanobenzene and as such have 
the potential to be synthesized at low cost, 
Uoyama et al. believe, because the synthetic 
process does not require rare metal catalysts 
or rare heavy metal atoms. Furthermore, the 
OLEDs can emit multiple colours, from sky- 
blue (with a peak wavelength at 473 nanome- 
tres) to orange (577 nm). 

There are many challenges ahead before 
these molecules can be fully accepted into 
production. The emissive colours have to 
be developed for display applications; the 
operational lifetime of devices that incorpo- 
rate the molecules needs to be comparable 
to, or preferably better than, state-of-the-art 
devices (possibly the hardest challenge); and 
the ability to manufacture products based on 
the new OLEDs must be proven. The mol- 
ecules are not certain to replace existing ones, 
but they may be shaped into contenders in 
terms of the cost and quality of desirable per- 
formance characteristics. Indeed, perfecting 
the new molecules will take time and effort, 
but Chihaya Adachi, who leads the work and 
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needed to help his team to overcome these 
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The NAD-dependent deacetylase SIRT2 is 


required for programmed necrosis 


Nisha Narayan'*, In Hye Lee’, Ronen Borenstein!, Junhui Sun’, Renee Wong”, Guang Tong’, Maria M. Fergusson’, Jie Liu’, 
Ilsa I. Rovira’, Hwei-Ling Cheng*, Guanghui Wang®, Marjan Gucek®, David Lombard®, Fredrick W. Alt*, Michael N. Sack’, 
Elizabeth Murphy’, Liu Cao’ & Toren Finkel! 


Although initially viewed as unregulated, increasing evidence suggests that cellular necrosis often proceeds through a 
specific molecular program. In particular, death ligands such as tumour necrosis factor (TNF)-« activate necrosis by 
stimulating the formation of a complex containing receptor-interacting protein 1 (RIP1) and receptor-interacting 
protein 3 (RIP3). Relatively little is known regarding how this complex formation is regulated. Here, we show that 
the NAD-dependent deacetylase SIRT2 binds constitutively to RIP3 and that deletion or knockdown of SIRT2 prevents 
formation of the RIP1-RIP3 complex in mice. Furthermore, genetic or pharmacological inhibition of SIRT2 blocks 
cellular necrosis induced by TNF-a. We further demonstrate that RIP1 is a critical target of SIRT2-dependent 
deacetylation. Using gain- and loss-of-function mutants, we demonstrate that acetylation of RIP1 lysine 530 
modulates RIP1-RIP3 complex formation and TNF-a-stimulated necrosis. In the setting of ischaemia-reperfusion 
injury, RIP1 is deacetylated in a SIRT2-dependent fashion. Furthermore, the hearts of Sirt2~'~ mice, or wild-type 
mice treated with a specific pharmacological inhibitor of SIRT2, show marked protection from ischaemic injury. 
Taken together, these results implicate SIRT2 as an important regulator of programmed necrosis and indicate that 
inhibitors of this deacetylase may constitute a novel approach to protect against necrotic injuries, including ischaemic 


stroke and myocardial infarction. 


Several forms of cell death exist, each showing distinctive morpho- 
logical features. The apoptotic program is an energy-dependent method 
of cell death that results in cytoplasmic shrinking, nuclear condensa- 
tion, caspase activation and the ultimate fragmentation of the cell. 
Necrosis, on the other hand, is thought to occur in an energy-depleted 
setting and involves loss of membrane integrity, subsequent swelling of 
the cell and eventual cellular lysis. Although initially viewed as a passive 
or default pathway, accumulating evidence suggests that at least some 
forms of necrosis are programmed and regulated’ ’. This programmed 
necrosis, also termed necroptosis, may have important implications in 
the cellular response to a host of insults, including bacterial and viral 
infection, various neurodegenerative processes, as well as ischaemia- 
reperfusion injury of the brain and heart’. 

Ligands such as TNF-a, FASL and TRAIL can induce both apop- 
totic and necrotic cell death. For instance, the addition of TNF-« 
results in the activation of the serine/threonine kinase RIP1 that is 
in turn important for the ligand-stimulated activation of NF-«B, the 
execution of apoptosis through its interaction with FADD, as well as 
the induction of necrosis via its complex formation with the RIP3 
kinase’**°, The interaction of RIP1 and RIP3 seems to be required for 
necrosis and occurs through a homotypic interaction motif known as 
the RHIM domain*’. Necrostatin-1, a pharmacological inhibitor of 
RIP1 kinase, seems to inhibit RIP1-RIP3 interaction as well as inhi- 
biting necrotic cell death’. Nonetheless, the precise molecular mecha- 
nisms that regulate the interaction of RIP] and RIP3 are poorly 
understood. Here, we demonstrate an obligate role for the NAD- 
dependent deacetylase SIRT2. 


SIRT2 binds to RIP3 


Of the seven known mammalian sirtuin isoforms, relatively little is 
known about the predominantly cytosolic family member SIRT2*. In 
an effort to more fully understand the role of SIRT2, we transfected 
cells with a Flag-tagged version of SIRT2 to identify interacting pro- 
teins. Using a threshold of two or more independent peptide frag- 
ments, we identified only a handful of proteins that were specifically 
immunoprecipitated when the Flag peptide eluant was analysed by 
mass spectroscopy (Supplementary Fig. 1). Besides SIRT2 itself, this 
analysis identified heat shock protein 1A as well as B-tubulin, the 
heterodimeric partner of one of the few known targets of SIRT2- 
dependent deacetylation’. The final SIRT2 candidate interacting pro- 
tein using this approach was RIP3. On the basis of the novelty and 
potential biological importance, we sought to pursue the putative 
interaction of SIRT2 and RIP3. 

We first confirmed this protein interaction using epitope-tagged SIRT2. 
Using this approach, we were able to identify co-immunoprecipitation 
of SIRT2 and RIP3 (Fig. 1a). In contrast, SIRT2 did not seem to interact 
with RIP1 (Supplementary Fig. 2). The interaction between SIRT2 and 
RIP3 was not altered when cells were stimulated to undergo programmed 
necrosis using the combination of TNF-a along with the caspase inhibitor 
z-VAD-fmk. We were able to observe a similar interaction with and with- 
out necrotic stimulation between endogenous SIRT2 and endogenous 
RIP3 (Fig. 1b). Using *°S-labelled SIRT2 and either full-length glu- 
tathione-S-transferase (GST)-tagged GST-RIP3 or various GST-RIP3 
deletion mutants, we could demonstrate that SIRT2 bound the carboxy 
terminus of RIP3 in vitro (Fig. Ic). 
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Figure 1 | SIRT2 interacts with RIP3 and regulates RIP1-RIP3 complex 
formation. a, b, In L929 cells epitope-tagged SIRT2 co-immunoprecipitates 
with endogenous RIP3 (a), as does endogenous SIRT2 and RIP3 (b). IP, 
immunoprecipitation; WB, western blot. This interaction was unchanged by 
the addition of the necrotic stimulus TNF-« and z-VAD-fmk (T/z). 


Given the interaction of SIRT2 with RIP3, we next asked whether 
SIRT2 might modulate the RIP1-RIP3 interaction. Mouse embryonic 
fibroblasts (MEFs) obtained from wild-type or Sirt2-‘~ mice (Sup- 
plementary Fig. 3) were analysed in the presence or absence of the 
combination of TNF-a, cycloheximide and z-VAD-fmk to induce 
necrosis. As expected, in wild-type MEFs, the addition of this necrotic 
stimulus resulted in RIP1-RIP3 complex formation (Fig. 1d). In con- 
trast, in Sirt2—'~ MEFs formation of this necroptosis complex was not 
observed. 
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c, Schematic representation of the functional domains of RIP3 and 
corresponding GST fusion constructs used to assess in vitro binding of 
[?°S]SIRT2. Amino acids 440-518 seem to be required for SIRT2 binding. 

d, Wild-type and Sirt2'~ MEFs were stimulated for 2h with the combination 
of TNF-a, z-VAD-fmk and cycloheximide (T/z/c) to induce necrosis. 


SIRT2 is required for necroptosis 


The absence of a RIP1-RIP3 complex indicated that SIRT2 might 
modulate the overall necrotic response. The mouse L929 cell line is 
often analysed in this context because it undergoes robust necrosis in 
response to the combination of TNF-« and z-VAD-fmk*®. To assess 
the role of SIRT2 in programmed necrosis, we knocked down the 
expression of SIRT2 in L929 cells using two separate short hairpin 
RNAs (shRNAs; Fig. 2a). As we observed in Sirt2-'~ MEEs, knock- 
down of SIRT2 in L929 cells blocked RIP1-RIP3 complex formation 
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Figure 2 | Inhibition of SIRT2 prevents cellular necrosis. a, Expression of 
SIRT2 in L929 cells in uninfected cells (Uninf), cells infected with a control 
lentivirus (Ctl inf), or cells infected with one of two independent shRNAs 
targeting Sirt2 (shSirt2; and shSirt2)). b-e, Representative FACS analysis of 
annexin V and propidium iodide (PI) staining initially (b, d) or 5h after 
stimulation with TNF-« and z-VAD-fmk (T/z) (c, e). f, Level of PI-positive 
necrotic cells in the presence or absence of TNF-« and z- VAD-fmk for control 
cells or cells with shRNA-mediated knockdown of SIRT2 (mean + s.d., n = 4 
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independent experiments). g, Necrosis in the presence or absence of TNF-a, 
z-VAD-fmk and cycloheximide (T/z/c) in wild-type MEFs (+/+) or Sirt2-/~ 
MEFs (mean = s.d., n = 3 independent experiments). h, i, Treatment with the 
SIRT2 inhibitor AGK2 blocks RIP1-RIP3 complex formation (h) and 
TNF-o-induced necrosis (i) (mean + s.d., n = 3 independent experiments). 
**P < 0.01 by ANOVA followed by Tukey-Kramer multiple comparison for 
significance. 
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in the setting of TNF-a stimulation (Supplementary Fig. 4). In control 
knockdown cells, 5h after the addition of TNF-« and z-VAD-fmk, a 
significant fraction of the cells appeared necrotic as evident by show- 
ing positive staining for both annexin V and propidium iodide (Fig. 2b, c). 
In contrast, in cells with stable SIRT2 knockdown, there was a lack of 
positive propidium iodide staining, indicating a defect in programmed 
necrosis (Fig. 2d, e). This lack of necrosis was evident in both stable cell 
lines containing SIRT2 knockdown (Fig. 2f), as it was in Sirt2-'~ MEFs 
treated with TNF-«, cycloheximide and z-VAD-fmk (Fig. 2g). Finally, we 
took advantage of the availability of a previously developed pharmaco- 
logical inhibitor of SIRT2 deacetylase activity'®''. Treatment of L929 
cells with the specific SIRT2 deacetylase inhibitor AGK2 inhibited the 
formation of the RIP1-RIP3 complex (Fig. 2h). This treatment also 
effectively inhibited programmed necrosis in these cells (Fig. 2i). 


SIRT2 regulates RIP1 acetylation 


Taken together, our results demonstrate a requirement for SIRT2 in the 
formation of the RIP1-RIP3 complex and subsequent programmed 
necrosis that is observed following stimulation with TNF-a. Given that 
AGK2 inhibits SIRT2 deacetylase activity'®, the observation that this 
compound inhibits the RIP1-RIP3 interaction suggests that SIRT2- 
dependent deacetylation of RIP3 or RIP1 might regulate complex 
formation. Our analysis revealed that, although RIP3 was acetylated, 
the level of endogenous RIP3 acetylation was not altered following 
necrotic stimulation or after knockdown of SIRT2 (Supplementary 
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Fig. 5). In contrast, transfection of RIP1 into the L929 cell line demon- 
strated that RIP1 was acetylated under basal conditions and that the 
level of RIP1 acetylation markedly declined after stimulation with TNF- 
a and z-VAD-fmk (Fig. 3a and Supplementary Fig. 6). This decline was 
not observed in L929 cells with stable SIRT2 knockdown (Fig. 3a). A 
similar TNF-c-induced, SIRT2-dependent deacetylation was also 
observed with endogenous RIP1 (Supplementary Fig. 7). 

These results indicate that RIP1 may bea target for SIRT2-dependent 
deacetylation. Consistent with this hypothesis, immunopurified RIP1 
could be effectively deacetylated in vitro by the addition of purified wild- 
type SIRT2, whereas the addition of a catalytically inactive mutant of 
SIRT2 (Q130A) had no effect (Fig. 3b). Furthermore, addition of the 
specific SIRT2 inhibitor AGK2, but not the HDAC6 inhibitor tubacin, 
blocked RIP1 deacetylation under necrotic conditions (Fig. 3c). We 
next sought to gain further insight into the relationship between nec- 
rosis and RIP1 deacetylation. Our results indicate that the addition of 
TNF-a and z-VAD-fmk stimulates the SIRT2-dependent deacetylation 
of RIP1. The two most likely explanations for these observations is 
either that necrosis directly stimulates SIRT2 activity, or that necrosis 
brings RIP1 and the RIP3-SIRT2 complex in close proximity, thus 
facilitating deacetylation of RIP1. When cells were stimulated to undergo 
necrosis, we observed no significant increase in SIRT2 activity, nor were 
there measurable changes in cellular NAD levels (Supplementary Fig. 8). 
In contrast, knockdown of RIP3 inhibited RIP1 deacetylation under ne- 
crotic conditions (Fig. 3d and Supplementary Fig. 9). Similarly, addition 
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Figure 3 | RIP1 is a target of SIRT2-dependent deacetylation. a, Levels of 
acetylation for epitope-tagged RIP1 in either control L929 cells or L929 cells 
with stable knockdown of SIRT2 expression. Acetylation was determined in the 
presence or absence of TNF-« and z-VAD-fmk (T/z) treatment and assessed 
using an acetyl-lysine (AcK)-specific antibody. HA, haemagglutinin. b, RIP1 
can be deacetylated in vitro by wild-type SIRT2, but not by a deacetylase- 
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inactive SIRT2 mutant (SIRT2(Q130A)). c, Under necrotic conditions, 
deacetylation of RIP1 is blocked by the SIRT2 inhibitor AGK2, but not by the 
HDAC6 inhibitor tubacin. d, In L929 cells, knockdown of RIP3 inhibits RIP1 
deacetylation following the induction of necrosis. e, Treatment of L929 cells with 
necrostatin-1 inhibits the necrosis-induced deacetylation of RIP1. 
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of necrostatin-1, a small molecule that is thought to lock RIP1 in an 
inactive configuration and block RIP1-RIP3 interaction”, also inhibited 
RIP1 deacetylation (Fig. 3e). Thus, our data are most consistent with 
TNF-o and z-VAD-fmk bringing RIP1 and the RIP3-SIRT2 complex in 
close proximity to each other, thereby facilitating SIRT2-dependent 
deacetylation of RIP1. 

We next sought to identify the lysine residues of RIP1 that might be 
critical targets for SIRT2-dependent deacetylation. Unfortunately, we 
found it difficult to map in vivo acetylation sites, most probably because 
RIP1 is both toxic and rapidly degraded when overexpressed’*'®. We 
did however note that in vivo expression of the p300 acetyltransferase 
markedly increased the level of basal RIP1 acetylation (Supplementary 
Fig. 10). We reasoned that lysine acetylation of RIP1 near the RHIM 
domain of the protein would provide an attractive model to explain the 
SIRT2-dependent nature of the RIP1-RIP3 interaction. The RHIM 
domain of human RIP1 encompasses amino acid residues 531-547 
(ref. 17). Interestingly, there is a lysine residue immediately adjacent 
to the RHIM domain (lysine 530) that is predicted to be a site for 
internal acetylation using a widely available bioinformatic approach 
(Supplementary Fig. 11; score of 1.17 by the PAIL algorithm on high 
stringency, predictive power >89%). Using mass spectroscopy, we 
confirmed that this lysine residue could be acetylated by p300 
(Supplementary Fig. 11). Moreover, recombinant SIRT2 efficiently 
deacetylated lysine 530 when a mixture of acetylated and deacetylated 
peptides encompassing amino acids 529 to 546 of RIP1 was used as a 
substrate (Fig. 4a). 

We next sought to obtain further evidence that SIRT2-dependent 
deacetylation regulates RIP1. In cells, expression of wild-type SIRT2 
resulted in the deacetylation of co-expressed wild-type RIP1 (Fig. 4b). 
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Figure 4 | Acetylation status of RIP1 lysine 530 regulates necrosis. 

a, Recombinant SIRT2 can efficiently deacetylate a peptide substrate 
encompassing amino acids 529 to 546 of RIP1. The substrate contains a mixture 
of peptides with either acetylated or non-acetylated lysine 530. b, Lysine 
acetylation of wild-type RIP1 and the site directed mutant RIP1(K530Q) in 
1929 cells following co-transfection of either wild-type SIRT2 or a deacetylase 
inactive mutant (Q130A). ¢, Comparison of complex formation for wild-type 
RIP1, a loss of function mutant (RIP1(K530A) mutant) or for a gain of function 
mutant (RIP1(K530Q)). d, Analysis of various RIP1 constructs reconstituted in 
a Jurkat T cell line that lacks RIP1 expression. Stable cell lines were generated 
and the level of necrosis was assessed 14h after TNF-o stimulation. Data were 
analysed by an ANOVA followed by the Tukey-Kramer multiple comparison 
method to determine significance (n = 3, mean = s.d.). 
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This was not observed when a deacetylase-deficient form of SIRT2 
(Q130A) was expressed. Similarly, this deacetylation seemed to be 
specific, as expression of a related sirtuin family member, SIRT1, 
did not alter the level of wild-type RIP1 acetylation (Supplementary 
Fig. 12). In contrast, a site-directed mutant of RIP1 at position 530 
that mimics constitutive acetylation (RIP1(K530Q)) was insensitive 
to further SIRT2-dependent deacetylation (Fig. 4b). Similarly, 
whereas as previously observed wild-type RIP1 underwent marked 
deacetylation following TNF-« and z-VAD-fmk treatment, the acet- 
ylation of the RIP1(K530Q) mutant was insensitive to ligand addition 
(Supplementary Fig. 13). 

These results are consistent with lysine 530 being a critical target of 
SIRT2 and that, following deacetylation of this residue, additional RIP1 
lysine residues can be deacetylated by SIRT2 in a processive fashion. To 
further pursue the role of lysine 530, we analysed site-directed mutants of 
RIP1 that represent potential gain (RIP1(K530Q)) or loss (RIP1(K530A)) 
of acetylation function. We then evaluated the ability of these various 
RIP1 mutants to form a complex with RIP3. As expected, using wild-type 
RIP1, a RIP1-RIP3 complex was induced after stimulation with TNF-« 
and z-VAD-fmk (Fig. 4c). In contrast, the RIP1(K530A) mutant seemed 
to form a complex with RIP3 independent of ligand stimulation, whereas 
the RIP1(K530Q) mutant failed to interact with RIP3 either under basal 
conditions, or under conditions that stimulate cellular necrosis (Fig. 4c). 
To further confirm the specific functional importance of lysine 530, we 
used a previously described Jurkat T cell line that had been engineered to 
lack RIP1 expression and fails to undergo programmed necrosis with 
TNF-a and z-VAD-fmk unless RIP1 is reconstituted'*””. Using this 
RIP1-deficient cell line, we created stable cell lines reconstituted with 
wild-type RIP1, the various lysine 530 RIP1 mutants, or an empty vector. 
As expected, expression of wild-type RIP1 restored a necrotic response to 
cells that were stimulated with TNF-« and z-VAD-fmk (Fig. 4d). Com- 
pared to wild-type RIP1, the level of observed necrosis was higher in cells 
expressing RIP1(K530A), whereas cells expressing the RIP1(K530Q) 
mutant were unable to undergo TNF-«-induced necrosis. Together, these 
data support the notion that SIRT2-dependent deacetylation of lysine 530 
of RIP1 is required for stable RIP1-RIP3 complex formation and for 
ligand-dependent programmed necrosis. 


Inhibiting SIRT2 reduces ischaemic injury 


We next sought to further assess the physiological implications of our 
observations. Previous data have indicated that inhibition of necrosis 
by agents such as necrostatin-1 can reduce ischaemia-reperfusion 
injuries in critical organs such as the brain and heart’’°. We reasoned 
that genetic or pharmacological inhibition of SIRT2 might provide 
similar protection. We subjected the hearts of age- and sex-matched 
littermates of wild-type or Sirt2~'~ mice to a protocol of ischaemia- 
reperfusion (Fig. 5a). At the end of reperfusion, the hearts of Sirt2/~ 
mice showed improved function as assessed by their rate pressure 
product (Fig. 5b). This functional improvement was accompanied 
by an approximate 50% reduction in the size of myocardial infarcts 
when Sirt2-‘~ mice were compared to their wild-type littermates 
(Fig. 5c, d; wild-type mice infarct size 32.3 + 4.7% of myocardium, 
Sirt2-/~ mice 19.0+ 8.8%, mean+s.d, n=11 per genotype, 
P<0.001). As we observed following TNF-« and z-VAD-fmk stimu- 
lation, ischaemia-reperfusion also resulted in the in vivo deacetylation 
of RIP1 (Fig. 5e). This deacetylation was reduced in Sirt2-‘~ mice, or 
in wild-type mice treated with the SIRT2 inhibitor AGK2. Similarly, as 
observed in other cases of stimulated necrosis, ischaemia-reperfusion 
resulted in an observed increase in RIP1-RIP3 interaction that was also 
not evident in either Sirt2-‘~ mice or in AGK2-treated wild-type 
animals (Supplementary Fig. 14). Consistent with these biochemical 
data, treatment of wild-type mice with AGK2 produced a similar im- 
provement in functional recovery after ischaemia-reperfusion (Fig. 5f) 
and corresponding reduction in ultimate infarct size (Fig. 5g; wild-type 
vehicle 29.4 + 2.4% of myocardium and AGK2-treated 16.0 + 0.4%, 
mean + s.d.,n =5 per group, P< 0.001). When we further assessed 
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Figure 5 | Inhibition of SIRT2 protects against ischaemia-reperfusion 
injury. a, Protocol for myocardial injury involving 20 min of global ischaemia 
followed by a 2-h reperfusion period. b, Rate pressure product (RPP = heart 
rate X left ventricular developed pressure), a measurement of cardiac function 
demonstrating preserved function in Sirt2-‘~ mice, compared to wild-type 
(WT) littermates (n = 11 mice per group; 6 females and 5 males per group). 
c, Representative histological sections stained with 2,3,5-triphenyltetrazolium 
chloride (TTC) as a marker of necrotic damage demonstrating less 

apparent infarct area (pale region/TTC positive) in the hearts of Sirt2 ‘~ mice. 
d, Quantification of infarct size in WT and Sirt2-'~ mice (n = 11 per group; 


cardiac ischaemia-reperfusion injury using additional in vivo models, 
we noted a similar decrease in infarct size in Sirt2-‘~ mice (Sup- 
plementary Fig. 15). 


Discussion 


Our results establish a role for SIRT2 in regulating necroptosis. In par- 
ticular, we show that SIRT2-dependent deacetylation of RIP1 serves as 
an important regulatory mechanism for programmed necrosis. Al- 
though no formal structure exists, on the basis of work with related 
kinases, RIP1 is thought to spontaneously interconvert between two 
separate conformations, with TNF-o stimulation stabilizing the active 
state’*. We propose that necrotic stimulation brings this active con- 
formation of RIP1 into close proximity with the pre-formed RIP3- 
SIRT2 complex, allowing for lysine 530 deacetylation with subsequent 
stable complex formation (Fig. 5g). This model is supported by our 
data using necrostatin-1 that indicate that, in the setting of TNF-« 
stimulation, RIP1 most likely needs to be in an active conformation 
to undergo deacetylation (Fig. 3e and Supplementary Fig. 16). Simi- 
larly, because RIP3 knockdown inhibits RIP1 deacetylation with- 
out altering overall SIRT2 levels (Fig. 3d), our data indicate that, 
under physiological conditions, RIP1 deacetylation occurs in a RIP3- 
dependent fashion. Finally, our results with the RIP1 gain and loss of 
function mutants, including the RIP1(K530A) mutant that undergoes 
ligand-independent interaction, indicate that RIP1 deacetylation pre- 
cedes RIPI-RIP3 complex formation. Interestingly, although the 
RIP1(K530A) mutant formed a constitutive complex with RIP1 these 
cells did not undergo spontaneous necrosis, suggesting that RIP1-RIP3 
complex formation is not by itself sufficient for necroptosis. It will be 
important to further test the in vivo correlates of these observations using 
the appropriate RIP1 K530 knock-in mouse models. 


6 females and 5 males per group). e, Levels of RIP1 acetylation in hearts under 
basal conditions or after ischaemia/reperfusion (I/R). RIP1 acetylation 
decreased in WT heart after I/R. Acetylation of RIP1 was maintained during I/R 
in wild-type AGK2 treated animals or Sirt2~‘~ hearts. Shown is one of two 
similar experiments. f, g, RPP (f) and infarct size (g) in WT mice treated with 
vehicle alone or with 15 uM AGK2, a specific SIRT2 inhibitor. n = 5 per group; 
3 males and 2 females with all data expressed as mean + s.d.; *P < 0.05, 

**P < 0.01 by unpaired Student t-test. h, A model for SIRT2-dependent 
regulation of programmed necrosis. 


SIRT2 has been previously implicated as a tubulin deacetylase’, a 
potential regulator of cell cycle progression”’, a determinant of myeli- 
nation” and very recently as a tumour suppressor’. Our data establish 
another important biological function of SIRT2, namely a regulator of 
necroptosis. Given that the activities of the sirtuins are thought to be 
regulated by NAD levels, it is possible that the SIRT2-dependence of 
the RIP1-RIP3 interaction may provide a mechanism to regulate cel- 
lular necrosis based on overall cellular energetics. In this scenario, the 
balance between apoptosis and necrosis would be potentially regulated 
by tissue NAD levels”*. 

Finally, it may also be important to note that the initial description 
of the specific SIRT2 inhibitor AGK2 showed that this compound was 
effective in a Drosophila model of neurodegeneration’®. The mecha- 
nism for this protection was not defined in this initial study. Given the 
known role of necrosis in neurodegenerative conditions", it is temp- 
ting to speculate that our present observations may provide some 
insight into these previously observed benefits. These past observa- 
tions, coupled with our own in vivo data, suggest that transient inhibi- 
tion of SIRT2 may have widespread clinical utility for a range of 
clinically important conditions including ischaemic stroke, myocar- 
dial infarction, as well as presumably the growing list of conditions 
where necrosis is thought to have a significant role’’. 


METHODS SUMMARY 

Cells and mice. L929 cells (American Type Culture Collection) were cultured at 
37 °C in DMEM (Invitrogen) supplemented with 10% FBS, 100 units ml’ peni- 
cillin, and 100 pg ml~' streptomycin. MEFs were prepared in standard fashion 
and maintained in DMEM supplemented with 15% FBS. To generate SIRT2- 
deficient mice, we cloned the murine Sirt2 gene from a 129 mouse genomic 
DNA library. A 2.7 kb SacI-Sacl fragment containing exons 3 and 4 was cloned 
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into the pLNTK vector as a 5’ arm, and a 3.8-kb SacI-Sacl fragment was inserted 
into the opposite side of the pGK-Neo cassette as a 3’ arm. By homologous 
recombination, the resulting targeting construct deleted exons 5, 6 and a part 
of exon 7 of the Sirt2 gene. The targeting construct was electroporated into TC1 
embryonic stem cells, and correctly targeted clones were isolated via positive and 
negative selection followed by Southern blotting. The targeted clones were further 
transiently transfected with pMC Cre recombinase to remove the Neo gene and 
subsequently injected into C57BL6/J blastocysts to obtain chimaeras. The induc- 
tion of necrosis was in general as previously described’. In brief, L929 cells were 
pre-treated with 50 1M of zVAD-fmk for 1h before stimulation with 100 ng ml" 
of recombinant mouse TNF-« (R&D Systems). L929 cells were collected 2 h after 
stimulation for biochemical assays, including RIP1-RIP3 complex formation and 
acetylation detection, or 5h after stimulation to assess necrosis. Induction of 
necrosis was similar for MEF cells, except that these cells were pre-treated for 
1h with 20 WM zVAD-fmk and 1 pg ml cycloheximide followed by 50 ng ml? 
TNF-o stimulation. MEF cells were routinely collected 5h after TNF-« stimu- 
lation for biochemical studies and 18 h after stimulation for determining necrosis. 
Full methods are available as Supplementary Information. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Cells, immunoprecipitation and protein analysis. HeLa and 293T cells (both 
obtained from ATCC) were grown in DMEM (Invitrogen) supplemented with 
10% EBS, 100 units ml’ penicillin, and 100 ug ml streptomycin. RIP1-deficient 
Jurkat T cells (a gift of M. Lenardo) were cultured in RPMI media (ATCC) 
supplemented with 15% FBS (ATCC), 100 units ml’ penicillin and 100 pg ml? 
streptomycin. 

For immunoprecipitation analysis, L929 or MEF cell lysates (3 mg protein) 
were mixed with the indicated antibody (2 1g) at 4 °C overnight followed by the 
addition of 80 ul of protein G-Sepharose (Amersham Biosciences) for 1h at 4 °C. 
Immune complexes were washed five times with lysis buffer (Cell Lysis Buffer 
from Cell Signaling), supplemented with complete mini-protease inhibitor cock- 
tail (Roche Applied Science). After boiling in 5X sample buffer, samples were 
subjected to SDS/PAGE, transferred to nitrocellulose using the iBlot Dry Blotting 
System (Invitrogen), and then immunoblotted with the indicated primary anti- 
bodies, including anti-haemagglutinin (Roche Applied Science), anti-Flag (Sigma), 
SIRT2 (SantaCruz Biotechnology), RIP3 (ProSci), actin (Cell Signaling), RIP1 (BD 
Biosciences) or acetyl-lysine (Cell Signaling), followed by the appropriate horse- 
radish-peroxidase-conjugated secondary antibodies (Santa Cruz Biotechnology). 
Bands were visualized by enhanced chemiluminescence (Pierce). Where indicated, 
cells treated with 10 1M AGK2 (Tocris) for 16h before collection. 

For the initial detection of SIRT2-interacting proteins, HCT-116 cells were 
transfected with either 5 1g of an empty Flag vector or a Flag-SIRT2 plasmid using 
Effectene transfection reagent (Qiagen). After an overnight incubation, the cells 
were collected in 1 ml of lysis buffer (50 mM Tris pH 7.4, 1% Triton X-100, 0.5% 
NP-40, 150 mM NaCl, protease and phosphatase inhibitor cocktail (Sigma) and 
10% glycerol) to yield 15-20 mg of protein lysates. These lysates were then incu- 
bated with anti-Flag agarose beads (Sigma) overnight at 4 °C. Immunoprecipitates 
were then washed 3 times in lysis buffer and twice more in elution buffer (50 mM 
Tris pH7.4, 150mM NaCl). Elution was performed at 4°C in 150 ul of elution 
buffer, including 3 ll of a5 mg ml * solution of Flag peptide (Sigma). The super- 
natant was then collected and subjected to in-solution protein digestion and pro- 
cessing by mass spectrometry. 

To determine acetylation sites, the RIP1 amino acid sequence was analysed 
using a widely available bioinformatics tool called PAIL (prediction of acetylation 
on internal lysines). We used high stringency settings to determine that lysine 530 
had a score of 1.17, demonstrating that the residue was likely acetylated (89% 
confidence). Further details regarding the algorithm can be found from previous 
studies” and at http://bdmpail.biocuckoo.org/. 

To directly measure SIRT2 activity we used the SIRT-Glo Assay (Promega). 
Briefly, mouse L929 cells were transfected with a plasmid encoding for Flag-SIRT2 
or an empty vector. Transfected cells were then either left untreated or stimulated 
to undergo necrosis with TNF-« (100ng ml! for L929 cells and 20 ng ml! for 
Jurkat T cells) and z-VAD-fmk (40 1M). Cells were collected and immunopreci- 
pitated using the Sigma Flag M2 antibody. The SIRT2 protein was eluted using Flag 
Peptide and was subsequently used as a source of purified enzyme in the protocol 
as specified by the manufacturer. 

NAD levels were measured in untreated L929 cells and L929 cells treated with 
TNF/zVAD combination using the EnzyChrom NAD/NADH Assay Kit (BioAssay 
Systems). The methods followed the manufacturer’s protocol. The values obtained 
were normalized for protein concentration. 

GST-tagged fusion proteins were expressed and purified as described previ- 

ously”®. The purity of the fusion proteins was tested by sodium dodecyl sulphate 
(SDS)-polyacrylamide gel electrophoresis (PAGE) and Coomassie staining. For 
protein interaction studies, in vitro translations were performed with rabbit retic- 
ulocyte lysate and the Promega TNT system, and radiolabelling was done with 
[*’S]cysteine (Perkin Elmer). Equal amounts of radiolabelled in-vitro-translated 
SIRT2 proteins was added to GST fusion proteins bound to glutathione resin and 
incubated for 1 h at 4 °C. After extensive washing with phosphate-buffered saline 
(PBS) containing 0.25% NP-40, the bound proteins were analysed by SDS-PAGE 
and autoradiography. 
Acetylation and deacetylation measurements. For assessment of RIP1 acetyla- 
tion, protein lysates from HeLa cells (4 mg), L929 cells (4 mg) or heart tissue (3.5 mg) 
were mixed with 20 ul of acetyl-lysine or RIP1 antibodies overnight at 4 °C followed 
by the addition of 60 pil of protein G-Sepharose (Amersham Biosciences) for 2 h at 
4°C. Immunoprecipitates were washed five times with lysis buffer as previously 
described. After boiling in 2X sample buffer, samples were subjected to SDS/ 
PAGE. After transfer to nitrocellulose, membranes were immunoblotted with the 
indicated primary antibodies followed by the appropriate horseradish peroxidase- 
conjugated secondary antibodies (Santa Cruz Biotechnology or Pierce). Bands were 
visualized by enhanced chemiluminescence (Pierce) as previously described”. 

For analysis of in vivo RIP1 acetylation, heart tissues with or without ischae- 
mia/reperfusion injury were collected from wild-type untreated mice, wild-type 
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AGK2-treated mice or Sirt2-'~ mice. The tissues were incubated in lysis buffer 
for 30 min at 4 °C followed by sonication (Sonicator XL 2020 Misonix, Inc.) for 
15-30 s at 20% amplitude. Lysates were subsequently clarified by centrifugation at 
16,000g (30 min at 4 °C) before further processing as described above. RIP acet- 
ylation was detected directly by immunoprecipitation with RIP1 antibodies (BD) 
and subsequent immunoblotting for acetyl-lysine antibody (Cell Signaling Tech- 
nology). Similarly, we detected RIP1-RIP3 complex formation in myocardial 
tissues under basal conditions or following ischaemia/reperfusion. Lysate was 
prepared as above and samples were first immunoprecipitated with a RIP1 anti- 
body followed by western blotting with a RIP3 antibody (Millipore). For endo- 
genous RIP3 acetylation, we analysed L929 cells with or without shRNA- 
mediated knockdown of SIRT2. Cell lysates were immunoprecipitated with an 
acetyl-lysine antibody and subsequently analysed by western blotting using a 
RIP3 antibody as described above. 

To assess the effect of p300 expression on RIP1 acetylation, we co-transfected 
HeLa cells with Flag-tagged RIP1 with or without an expression vector encoding 
the p300 acetyltransferase as previously described”*. Cells were collected 24 h after 
transfection and RIP1 acetylation assessed as described above. 

To assess in vitro deacetylation of full-length RIP1, we first prepared acetylated 
substrate by transfecting 293T cells with an expression construct encoding HA- 
RIP1. Twenty-four hours after transfection, cells were collected in lysis buffer and 
7 mg of transfected protein lysate was incubated with 4 1g of HA antibody and 
subsequently bound to Protein G Sepharose beads. To increase levels of RIP1 
acetylation, 10 4M AGK2 was added for the last 12h before cell collection. 
Purified SIRT2 protein was obtained by transfecting a Flag-tagged wild-type or 
deacetylase-inactive SIRT2 expression construct into 293T cells. The epitope 
product was purified by using an anti-Flag M2 affinity gel (Sigma) and subse- 
quently eluted competitively with five one-column volumes of a solution contain- 
ing 100 pg ml * Flag peptide (Sigma) in elution buffer. 

The deacetylase reaction was performed in the presence or absence of ~300 ng 
of purified wild-type or mutant SIRT2 along with NAD (10 mM). Equal amounts 
of acetylated substrate was prepared and the deacetylation reaction was per- 
formed for 2h at 37°C in deacetylation assay buffer (50mM Tris*HCl, 1 mM 
MgCl and 150mM NaCl (pH 8.0)) using a total reaction volume of 50 pl. The 
deacetylation of each protein substrate was subsequently detected after SDS/ 
PAGE, transfer to nitrocellulose, and immunodetection with an anti-acetyl-lysine 
antibody. 

For measurement of deacetylation of a peptide substrate, a RIP1 peptide with 
the sequence IKYTIYNSTGIQIGAYNY was synthesized in the unacetylated and 
K530 acetylated form by Elim Biopharmaceuticals Inc. In vitro deacetylase assays 
were conducted with 15 1M acetylated/unacetylated RIP1 peptide mixture, 2 1g 
of human recombinant SIRT2 (Cayman Chemicals), 10 mM NAD, in the deace- 
tylation assay buffer at 37 °C for 2h. 

In vitro acetylation assays were performed with 150 1M RIP1 peptide, 12 pg of 
the HAT domain of the acetyltransferase p300 (Enzo Life Sciences), 1 mM acetyl 
CoA, in a buffer containing 150 mM HEPES, 10% glycerol, 3mM DTT, 30 mM 
sodium butyrate at 30 °C for one hour. 

LCMS analysis was carried out using an EksigentnanoLC Ultra system con- 
nected to an LTQ OrbitrapVelos mass spectrometer (Thermo Scientific). Peptide 
sample was first loaded on to a trapping cartridge (Agilent) at a flow rate of 
6 ll min~'. Separation of the trapped peptides was performed on a reversed- 
phase PicoFrit column (New Objective) using a linear gradient of acetonitrile 
(5-45%) in 0.1% formic acid at a flow rate of 250 nl min” '. Eluted peptides were 
sprayed into the mass spectrometer equipped with a nano-spray ion source. 
FACS analysis and necrosis assays. After stimulation with TNF-« (5h for L929 
cells, 14h for Jurkat T cells and 18 h for MEFs) cells were washed twice with cold 
PBS and then resuspended in 1X binding buffer (BD Biosciences) at a concen- 
tration of ~1 X 10° cells ml‘. Then, 100 pil of the solution (~1 X 10° cells) was 
transferred to a 5 ml culture tube and stained with 5 pl each of allophycocyanin- 
annexin V (BD Biosciences) and 50 pg ml! propidium iodide (Invitrogen). The 
cells were gently mixed and incubated at room temperature for 15 min. For 
assessment of necrosis, 400 pl of 1X binding buffer was then added to each tube 
and the samples were analysed by flow cytometry using a LSRII instrument 
(Becton-Dickinson). Where indicated tubacin and necrostatin-1 (Enzo Life 
Sciences) were used at concentrations of 2 1M for 4h and 20 iM for 1h, respect- 
ively, before the addition of the necrotic stimuli. 

Transfections, plasmids and lentiviral transductions. L929 cells were routinely 
transfected with GenJet (Signagen) as per the manufacturer’s protocol. The Flag- 
SIRT2 plasmid was obtained from Addgene, and the Q130A deacetylase-inactive 
mutant was made using the following primers: Forward primer 5'-CTACTCCT 
GCGCTGCTACACGGCAAACATAGAT; Reverse primer 5’-CGTGGTAGC 
AGCGCAGGAGTAGCCCCTTGTC. Glutathione-S-transferase (GST) fusion 
plasmids were all cloned in pGEX-6T. The RIP3 truncations below were cloned 
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into pGEX-6T with the following primers: Full length RIP3, forward primer 
5'-TCCCCCGGGTATGTCGTGCGTCAAGTTATGGCCC and reverse primer 
5'-CCGCTCGAGTTTCCCGCTATGATTATACCAACC; RIP3 with residues 
spanning positions 1 to 287, forward primer 5’-TCCCCCGGGTATGTCGTG 
CGTCAAGTTATGGCCC and reverse primer 5’CCGCTCGAGCTCACTGCT 
CCAGCAGAGCTGCATT; Residues spanning position 287 to 440 or 518, for- 
ward primer 5’-TCCCCCGGGTGAGCCCAAGGACAGACCCTCCTTC, and re- 
verse primers for 440, 5'-CCGCTCGAGTCGGGGTCCAGGACTTGGTGTTCC, 
and 518, 5'-CCGCTCGAGAGGGCCTTCTTGCGAACCTACTGG. The haemag- 
glutinin-tagged RIP1 and the K530 mutants were cloned into pcDNA3.1+ vector 
using EcoRV and Xhol sites and the following primers: K530Q, forward primer 5'- 
CCAACAGATGAATCTATACAGTATACCATATACAATAGT and reverse pri- 
mer 5'’-ACTATTGTATATGGTATACTGTATAGATTCATCTGTTGG. K530A, 
forward primer 5’-CCAACAGATGAATCTATAGCATATACCATATACAATA 
GT and reverse primer 5’-ACTATTGTATATGGTATATGCTATAGATTCA 
TCTGTTGG. For the construction of the 3X Flag-RIP1 we inserted the Rip1 gene 
into the pCMV-3 Flag vector by cutting both with EcoRI and KpnI. Rip1 gene 
products were prepared by PCR with the following primers: Forward primer: 
5'-CCGGAATTCCATGCAACCAGACATGTCCTT Reverse primer: 5'-CGGG 
GTACCCTAGTTCTGGCTGACGTAAAT The HA-tagged wild-type SIRT1 has 
been described”. The Flag-tagged RIP3 plasmid was a gift from X. Wang. 
Lentiviral vectors for stable shRNA-mediated knockdown of SIRT2 and RIP3 were 
purchased from Open Biosystems. We in general screened five shRNA for each 
gene to be knockdown and confirmed the biochemical and biological endpoints 
using at least two independent shRNAs. The sequence and clone ID used in this 
study included: SIRT2: clone ID TRCN0000012121, mature antisense TTCCAGC 
GTGTCTATGTTCTG. SIRT2: clone ID TRCN0000012119, mature antisense 
TTCTCCAGGTTTGCATAGAGG. RIP3: clone ID TRCN0000022536, mature 
antisense AAAGCTGTAGACATCACTCGC. RIP3: clone ID TRCN0000022538, 
mature antisense AAGGAGTTGTAGTTCCCAATC. 

Amplification was via standard methods in subconfluent 293T cells. Infection 
of L929 cells was performed in the presence of polybrene (Sigma) at a final 
concentration of 8g ml’. Cells were incubated with virus mixture for 24h 
and medium was changed to fresh growth medium with puromycin (Sigma) 
for stable knockdown. 

For constructing stable Jurkat T cells lines, we used a previously described 
mutant line of Jurkat cells that had undergone random chemical mutagenesis 
and was selected on the basis of the absence of detectable RIP1 expression’*"”. 
These cells fail to undergo TNF-«-stimulated necrosis without the re-introduc- 
tion of RIP1'’. RIP1-deficient Jurkat cells were electroporated with the indicated 
RIP1 wild-type or mutant construct, or with an empty vector control plasmid. 
Cells were selected for two weeks using 400 1g ml of geneticin (Invitrogen) 
before analysis. Pooled colonies expressing wild-type RIP1 or the various RIP1 
mutants were analysed for their degree of programmed necrosis stimulated by 


addition of TNF-« and z-VAD-fmk. Western blot analysis demonstrated that 
overall RIP1 expression was similar in all stable pooled clones. 
Langendorff perfusion and infarct model. Hearts from young wild-type or 
Sirt2-'~ littermates were isolated and Langendorff-perfused with Krebs- 
Heinseleit (KH) buffer for 25 min (equilibration) before being subjected to 
20 min of no-flow ischaemia. Functional recovery of left ventricular developed 
pressure (LVDP) and rate pressure product (RPP; product of LVDP and heart 
rate) was measured following 2h of reflow and expressed as a percentage of the 
initial RPP before ischaemia as previously described*’. For the AGK2 inhibitor 
experiments, mouse hearts were perfused with 1% BSA in KH buffer containing 
either the vehicle control (0.1% DMSO) or 154M of AGK2 (a specific SIRT2 
inhibitor) for 10 min before 20 min of no-flow ischaemia. This was then followed 
by 120 min of reperfusion with normal KH buffer. At the end of the 2-h reflow, 
hearts were incubated in KH buffer containing 1% (w/v) 2,3,5-triphenyltetrazo- 
lium chloride (TTC) to measure the extent of necrosis*!. Samples were sliced into 
cross-sections and area of infarct as a percent of the whole heart was quantified. 
For in vivo infarction, we used the previously described method of mouse 
coronary artery ligation that does not require ventilation support*’. In brief, 
the left coronary artery of the mouse was ligated for 30 min with a slipknot suture 
placed approximately 2 mm from the artery’s origin. At the end of the ischaemic 
period, the slipknot suture is removed and the hearts were reperfused in vivo for 
an additional 4h. At the end of the reperfusion procedure the animals were 
euthanized. Quantification was performed in a blinded fashion using the area 
at risk (determined by perfusing with Evans blue dye) and the area of infarction 
(TTC staining). These experiments were performed with female mice that were 
age-matched littermates. All animal experiments were performed in accordance 
and with approval from the NHLBI Animal Care and Use Committee. 
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Bypass of a protein barrier by a 
replicative DNA helicase 


Hasan Yardimci!, Xindan Wang’, Anna B. Loveland, Inger Tappin’, David Z. Rudner’, Jerard Hurwitz’, Antoine M. van Oijen** 


& Johannes C. Walter!* 


Replicative DNA helicases generally unwind DNA as a single hexamer that encircles and translocates along one strand of 
the duplex while excluding the complementary strand (known as steric exclusion). By contrast, large T antigen, the 
replicative DNA helicase of the simian virus 40 (SV40), is reported to function as a pair of stacked hexamers that pumps 
double-stranded DNA through its central channel while laterally extruding single-stranded DNA. Here we use 
single-molecule and ensemble assays to show that large T antigen assembled on the SV40 origin unwinds DNA 
efficiently as a single hexamer that translocates on single-stranded DNA in the 3’-to-5’ direction. Unexpectedly, 
large T antigen unwinds DNA past a DNA-protein crosslink on the translocation strand, suggesting that the large T 
antigen ring can open to bypass bulky adducts. Together, our data underscore the profound conservation among 
replicative helicase mechanisms, and reveal a new level of plasticity in the interactions of replicative helicases with 


DNA damage. 


An essential step in DNA replication is the unwinding of the double 
helix by a DNA helicase’”. In general, these enzymes form a hexame- 
ric ring that encircles and translocates along one DNA strand in the 
3'-to-5’ or 5'-to-3' direction while excluding the other strand. This 
steric exclusion mechanism of DNA unwinding has been demon- 
strated for gp4 protein in phage T7 (ref. 3), DnaB in bacteria*, and 
MCM2-7 in eukaryotes”®. 

The mammalian DNA tumour virus SV40 encodes a multifunc- 
tional protein, large T antigen, which functions as a replicative heli- 
case that cooperates with host cell replication factors to copy the SV40 
genome. Although cell-free SV40 replication has served as a model for 
mammalian DNA replication for several decades’, fundamental 
aspects of how large T antigen unwinds DNA remain controversial’. 
In particular, it is unclear whether large T antigen functions as an 
obligate double hexamer (Fig. 1A, a and b) or as a single hexamer 
(Fig. LA, c and d). In support of the double-hexamer model, large T 
antigen caught in the act of unwinding displayed ‘rabbit ear’ struc- 
tures with two single-stranded DNA (ssDNA) loops emanating from 
the double hexamer’ (Fig. 1A, a and b). In addition, mutations in large 
T antigen that disrupt double-hexamer formation inhibit origin- 
dependent DNA unwinding'*”’. Finally, purified large T antigen dou- 
ble hexamers exhibit higher unwinding activity than single hexamers 
on forked DNA substrates'*’’. Another long-standing question is 
whether large T antigen unwinds DNA by means of steric exclusion 
(Fig. 1A, a and c) or a fundamentally different mechanism in which 
the translocating helicase encircles duplex DNA while extruding 
ssDNA from side channels (Fig. 1A, b and d). Large T antigen can 
translocate on ssDNA in the 3’-to-5’ direction when presented with a 
ssDNA template'*’’, and the closely related E1 helicase encircles 
ssDNA in its central channel’’, supporting the steric exclusion model. 
Consistent with the double-stranded DNA (dsDNA) translocation 
model, structural studies showed that the central channel of large T 
antigen can expand to accommodate dsDNA and that large T antigen 
has side channels large enough for extrusion of ssDNA”. 


Furthermore, several studies suggest that large T antigen assembled 
on the SV40 origin encircles dsDNA” ”. These results raise the pos- 
sibility that large T antigen activated at an origin uses a dsDNA 
translocation mode. Inspired in part by large T antigen, both the 
double-hexamer and dsDNA translocation models were proposed 
for MCM2-7 (refs 23, 24), but recent evidence indicates that the 
native MCM2-7 complex unwinds DNA by steric exclusion and 
can function as a single hexamer*”’. 


Uncoupling of large T antigen double hexamers 


The double-hexamer model suggests that DNA is pumped towards 
the interface between two large T antigen hexamers (Fig. 1A, a and b). 
Therefore, an extended DNA molecule that is attached to a surface at 
both ends should not be replicated efficiently owing to tension that 
accumulates at the ends of the DNA (Supplementary Fig. 1c). By 
contrast, if large T antigen can function as a single hexamer, a 
stretched DNA molecule should undergo extensive large-T-antigen- 
dependent replication (Supplementary Fig. 1d). To distinguish 
between the single- and double-hexamer models, we replicated i 
DNA containing the SV40 origin (Aori) that was attached at one or 
both ends to the surface of a microfluidic flow cell (Fig. 1B). We first 
examined replication of singly tethered Aori DNA. After immobiliz- 
ing DNA on the surface”, large T antigen was drawn into the flow cell, 
allowing it to assemble on the origin (Fig. 1B). HeLa cell extract 
containing replication proteins”’** was then introduced to initiate 
replication, followed, after 20 min, by a second extract containing 
digoxigenin-modified dUTP (dig-dUTP) to pulse-label newly repli- 
cated DNA. After further incubation, dsDNA was stained with 
SYTOX Orange (SYTOX), and dig-dUTP-containing regions were 
labelled with fluorescein-conjugated anti-digoxigenin antibody (anti- 
dig). On 10-20% of the molecules, the SYTOX signal showed a high 
intensity tract (Fig. 1C, left, and Supplementary Fig. 2a) corresponding 
to a replication bubble. Furthermore, two anti-dig tracts were present 
at the ends of each replication bubble, consistent with bidirectionally 
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Figure 1 | Large T antigen is not an obligate double hexamer during 
replication. A, Models for DNA unwinding by large T antigen (T-ag). See 
main text for details. B, Experimental procedure for replication of singly and 
doubly tethered Aori DNA. C, SYTOX and anti-dig images of singly tethered 
(left) and doubly tethered (right) Aori DNAs that underwent replication in 
HeLa cell extracts. Dig-dUTP-incorporated regions occasionally exhibited 
higher SYTOX intensity owing to nonspecific staining of an anti-dig antibody 
with SYTOX. Extent of slack and replication on the doubly tethered DNA are 
indicated. Yellow arrowheads, estimated position of the origin. D, Length of 
anti-dig tracts on singly tethered (grey) and doubly tethered (black) DNA 
molecules after a 40-min dig-dUTP pulse. To measure the fork rate, the tract 
length distribution was fit to a Gaussian and the resulting average tract length 
was divided by the duration of the dig-dUTP pulse (40 min). 


growing bubbles (Fig. 1C, left, and Supplementary Fig. 2a). The gap 
between two anti-dig tracts always coincided with the expected loca- 
tion of the origin (Fig. 1C and Supplementary Fig. 2, yellow arrow- 
heads), and replication bubbles were not observed on } DNA lacking 
the SV40 origin (data not shown). The average fork rate measured 
from the anti-dig tract length was 188 + 61 base pairs (bp) min | 
(mean + s.d., = 174, Fig. 1D) in agreement with previous reports”. 
Therefore, this single-molecule approach is well suited to study large- 
T-antigen-dependent replication. 

We repeated the experiment on Aori DNA that was stretched to 85- 
90% of its B-form contour length and tethered at both 3’ ends. Iflarge 
T antigen functions as an obligate double hexamer, a doubly tethered 
DNA will not be replicated to a greater extent than the slack (10-15%) 
that is initially present in the molecule. In contrast to this prediction, 
we found that doubly tethered Aori molecules contained replication 
bubbles comprising much more than 10-15% of the molecule 
(Fig. 1C, right, and Supplementary Fig. 2b, c), indicating that indi- 
vidual large T antigen hexamers function efficiently, similar to 
MCM2-7 (ref. 25). Importantly, the average fork rate on doubly 
tethered DNA (195 + 75 bp min |, n= 78, Fig. 1D) was essentially 
the same as on singly tethered DNA, suggesting that any uncoupling 
of double hexamers does not impair fork progression. Finally, as 
expected from independently functioning replisomes, progression 
of sister forks showed no correlation on singly tethered (R = 0.17, 
P=0.11, n= 87, Supplementary Fig. 2d) and doubly tethered 
(R= —0.15, P = 0.35, n = 39, Supplementary Fig. 2d) Aori molecules. 

To visualize the spatial separation of large T antigen double hexamers 
in real time, we examined large-T-antigen-mediated DNA unwinding on 
doubly tethered Aori (Fig. 2a) using the ssDNA-binding replication pro- 
tein A (RPA) fused to the green fluorescent protein (GFP)-like protein 
mKikGR (RPA™“*S®), We detected bidirectionally growing linear tracts 
of RPA™<*CR (Fig. 2b), demonstrating that large T antigen double 
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Figure 2 | Real-time visualization of sister fork uncoupling during 
unwinding of doubly tethered DNA. a, Large T antigen was drawn into a 
flow cell containing doubly tethered Aori. After 45 min, RPA™<*CR was 
introduced and mKikGR was imaged for 60 min. b, Kymograph of mKikGR 
fluorescence. Minutes denote time after introduction of RPA™<*CR, 


hexamers can uncouple after initiation. The average unwinding rate 
(215 + 24bp min }+sd,n= 16), determined through the growth rate 
of RPA™SKGR tracts, was consistent with the unwinding rate of purified 
large T antigen” and our analysis of anti-dig tracts (Fig. 1D). The fact 
that DNA unwinding by large T antigen alone occurs at a similar rate as 
large-T-antigen-dependent replication indicates that large T antigen does 
not require coupling with a DNA polymerase to unwind DNA at optimal 
rates, unlike some prokaryotic helicases**?. 


Large T antigen unwinds DNA by steric exclusion 


We next examined whether large T antigen that initiates unwinding 
from the SV40 origin ultimately translocates on ssDNA or dsDNA. 
To this end, large T antigen was collided with biotin-streptavidin 
adducts, which large T antigen is known to disrupt'’. We modified 
a 518-bp-long linear duplex DNA containing the SV40 origin with a 
site-specific biotin on the top or bottom strands (Fig. 3A, a). If large T 
antigen translocates on ssDNA in the 3’-to-5’ direction, the hexamer 
moving to the right from the origin should remove streptavidin from 
the bottom strand (SA°*°™) but not the top strand (SA‘°?) (Fig. 3A, 
b). By contrast, the dsDNA translocation model predicts that both 
SAP” and SA‘? will be dislodged by large T antigen (Fig. 3A, c). 
We found that large T antigen displaced SA°*°™ (Fig. 3B, lane 8, 
red arrowhead, and Supplementary Fig. 3) but not SA‘ (Fig. 3B, 
lane 6, green arrowhead, and Supplementary Fig. 3), indicating that 
the bottom strand passes through the central chamber of large T 
antigen while the top strand is excluded (see Fig. 3C for quantifica- 
tion). We conclude that large T antigen that assembles on the 
origin DNA translocates on the leading strand template in the 3’- 
to-5’ direction. 

To examine the translocation mechanism of large T antigen during 
DNA replication, we attached a quantum dot (Qdot) by a digoxigenin- 
anti-dig interaction to either the leading (Qdot'**) or lagging (Qdot8) 
strand templates to the left of the origin on Aori (Fig. 3D, cartoons). As in 
the experiment above, the 3’-to-5’ ssDNA translocation model predicts 
the specific removal of Qdot!**4, whereas the dsDNA translocation model 
predicts the removal of both Qdot!**4 and Qdot!"8, Because most Qdots 
dissociated from Aori independently of replication (Supplementary Fig. 4 
and data not shown), we could not compare replicated molecules that 
retained and lost Qdots, respectively, to evaluate Qdot removal by the 
replisome. Given that large T antigen can displace streptavidin (Fig. 3B), 
most events in which the edge of an anti-dig tract coincided with a Qdot 
(Fig. 3D and Supplementary Fig. 4f) are likely to represent spontaneous 
fork stalling independently of the Qdot (see Supplementary Fig. 2 for 
examples of spontaneous fork stalling). Because the likelihood of such an 
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Figure 3 | Large T antigen translocates on ssDNA in the 3’-to-5’ direction. 
A, a, Cartoon of 518-bp-long 5’-labelled (red stars) DNA templates used for 
streptavidin displacement assays. A, b, c, Predictions of 3'-to-5’ ssDNA 
translocation (b) and dsDNA translocation (c) models. B, DNAs biotinylated on 
the top or bottom strands (as in A, a) undergo complete mobility shift after 
streptavidin (SA) addition, indicating that all DNA molecules are modified with 
biotin (lanes 2, 4). DNA was pre-incubated with buffer (lanes 5, 7) or streptavidin 
(lanes 6, 8), and unwinding was initiated with large T antigen and RPA (lanes 
5-8). Excess biotin saturated any displaced streptavidin. To assess the migration 
of ssDNA with or without streptavidin, DNA was heat denatured, rapidly cooled 
down and mixed with buffer (lanes 9, 11) or streptavidin (lanes 10, 12). Because 
both strands are radiolabelled, streptavidin association with one strand shifts 
only half of the signal (lanes 10, 12). ds, dsDNA; ss, ssDNA. C, Quantification of 


event is the same for Qdot'**4 and Qdot'’, we determined the ratio of 
molecules showing fork bypass through a Qdot (Supplementary Fig. 4e) 
to those exhibiting stalling at the Qdot (Supplementary Fig. 4f) to obtain 
a relative probability of Qdot removal by the replisome. Importantly, 
81% of the molecules exhibited fork bypass through Qdot"* (Fig. 3D, a) 
whereas only 21% bypass was observed through Qdot'**“ (Fig. 3D, b), 
suggesting that Qdot'* is more prone to displacement by the replisome. 
This result suggests that large T antigen also functions as a 3’-to-5’ 
ssDNA translocase during DNA replication. 


Large T antigen can bypass a bulky protein adduct 


Given that large T antigen encircles and translocates on the leading 
strand template, our observation that a considerable number of 
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ssDNA with (grey) and without (black) streptavidin from lanes 6 and 8 of 

B. Error bars indicate s.d. for three independent experiments. Some spontaneous 
dissociation of streptavidin occurred in the presence of free biotin (B, lanes 6 and 
8, ds). The extent of large-T-antigen-independent streptavidin dissociation was 
determined using the relative amounts of dsDNA that lost (ds) and retained SA 
(ds+SA). This fraction was then used to measure the amount of ssDNA that lost 
and retained streptavidin, respectively, if no spontaneous streptavidin 
dissociation had occurred. D, a, b, SYTOX, anti-dig and Qdot images of 
representative molecules after fork collision with Qdot'? (a) or Qdot'* (b) in 
HeLa cell extracts (performed as in Fig. 1B). Because dig-dUTP was continuously 
present during the replication reaction, replication bubbles were fully labelled 
with anti-dig. The percentage of molecules exhibiting fork bypass and stalling 
events is indicated (see also Supplementary Fig. 4). 


anti-dig tracts bypassed Qdot'*** 


was unexpected (Fig. 3D, b, left 
panel). Bypass through Qdot'*** indicates that large T antigen or other 
helicases in the extract can circumvent a bulky adduct on the leading 
strand template. To test this idea further, we covalently conjugated 
Hpall methyltransferase (M.Hpall) to the lagging (MH) or leading 
(MH'*) strand templates of a 518-bp-long linear DNA substrate 
containing the SV40 origin (Supplementary Fig. 5a). Notably, large 
T antigen unwound DNA containing MH" and MH" equally well 
(Fig. 4A). MH'*** was not removed from DNA by large T antigen 
during unwinding, as seen by the presence of ssDNA conjugated to 
M.Hpall (Fig. 4A, b). Uneven labelling of the two strands resulted in 
the intensity difference of ssDNA with and without M.Hpall 
(Supplementary Fig. 5b). 
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Figure 4 | Large T antigen can bypass a covalent protein barrier on the 
translocation strand. A, a, b, Large-T-antigen-dependent unwinding of DNA 
containing MH (a) or MH<*¢ (b). The species corresponding to each band 
are depicted. Heat denaturation caused electrophoretic smearing of M.Hpall- 
conjugated DNA and therefore was not used for assessment of ssDNA 
migration (data not shown). A, c, Quantification of unwinding in a and 

b. B, Left, unwinding of unmodified (lane 3), MH8-modified (lane 6), and 
MH'"**_ modified (lane 9) DNA by pre-assembled large T antigen (see text for 
details). The first two lanes in all samples correspond to 10% of input DNA 
without bead conjugation in the absence (lanes 1, 4, 7) and presence (lanes 2, 5, 
8) of large-T-antigen-mediated unwinding. Right, quantification of unwound 
DNA by pre-assembled large T antigen (lanes 3, 6, 9). Amount of unwinding 
was normalized to that of unmodified DNA. a.u., arbitrary units. Error bars 
indicate s.d. for three independent experiments. 


We next addressed whether the same large T antigen that initially 
collides with MH'**4 bypasses the protein adduct, or whether a new 
large T antigen hexamer from solution assembles downstream of the 
protein adduct. To this end, DNA was immobilized on magnetic beads, 
incubated with large T antigen to assemble the helicase at the origin, 
washed to remove free large T antigen, and finally mixed with RPA to 
initiate unwinding. We confirmed that during this sequence, unwind- 
ing initiated from the origin and that most unwinding was carried out 
by pre-assembled large T antigen (Supplementary Fig. 6). Importantly, 
large T antigen pre-assembled on the origin was sufficient to unwind 
the template containing MuHI<4 (Fig. 4B, left, lane 9), and unwinding 
efficiency was only moderately reduced relative to MH" (Fig. 4B, 
right). Thus, the vast majority of large T antigen molecules are able 
to bypass a bulky adduct on the translocation strand. 


Large T antigen collision with two protein barriers 


Our results suggest that after collision with a protein adduct, the large 
T antigen ring either opens to allow bypass (Supplementary Fig. 7a, ii) 
or denatures the protein adduct and threads the unfolded polypeptide 
chain through the helicase central channel (Supplementary Fig. 7a, 
iii). Because M.Hpall interacts with DNA by an internal amino acid’, 
the latter model would require large T antigen to accommodate two 
polypeptide chains and ssDNA in its central channel (Supplementary 
Fig. 7a, iii), which seems unlikely. To rule out the denaturation model, 
we repeated the experiment with a DNA template containing two 
protein adducts spaced 60 nucleotides (~200 A) apart along the 
duplex (Supplementary Fig. 7b, i). In this way, unwinding the DNA 
by M.Hpall denaturation would require four polypeptide chains and 
the translocation strand to be threaded through the central channel of 
large T antigen (Supplementary Fig. 7b, ii), which is incompatible 
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with known large T antigen structures'*’’. Notably, large T antigen 
unwound the template with two adducts as efficiently as the unad- 
ducted template (Fig. 5A, compare lanes 9 and 12, and 5B, b, for 
quantification), suggesting that bypass must occur by transient open- 
ing of the large T antigen ring. We found that two M.HpallI adducts 
spaced 14 nucleotides apart severely inhibited unwinding by large T 
antigen (Fig. 5A, lane 18, and 5B, c). Because the longitudinal axis of 
large T antigen (~120 A) is now greater than the inter-adduct dis- 
tance (~50 A), this result indicates that to perform effective bypass of 
the second obstruction, the large T antigen ring must re-close after the 
first adduct has traversed the entire channel (Supplementary Fig. 7b, 
iii and iv). Together, the data suggest that the large T antigen ring 
transiently opens to bypass protein-DNA crosslinks. 


Discussion 

Here we present single-molecule and ensemble experiments that 
determine the molecular mechanism by which large T antigen 
unwinds DNA at the replication fork. In contrast with the prominent 
double-hexamer model”’, we demonstrate that the large T antigen 
double hexamers that form at the origin of replication can physically 
separate and function independently when the DNA template is 
placed under tension. Together with previous results’®"’, our data 
suggest that interactions between hexamers are required for initiation 
but dispensable during elongation. The coupling between two hex- 
amers that led to the observation of rabbit ear structures’ may be due 
to direct interactions between hexamers or facilitated by other fac- 
tors**. Whether such double hexamers are normally retained during 
replication elongation in the complex environment of the cell is dif- 
ficult to assess. Importantly, the ability of hexamers to function inde- 
pendently is advantageous because it allows replication to go to 
completion even if one replisome stalls or collapses. Notably, we 
found that large T antigen that initiates unwinding from the origin 
ultimately surrounds ssDNA and moves in the 3’-to-5’ direction, as 
proposed for the related El helicase’’. Taken together, our results 
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Figure 5 | Bypass of tandem protein adducts by large T antigen depends on 
the inter-adduct distance. A, Unwinding of DNA containing, none, one or two 
M.Hpall adducts on the translocation strand by large T antigen. The three 
templates (colour-coded differentially) had slightly different lengths because each 
template contained 240 bp before the first protein adduct and 277 bp after the last 
protein adduct but different amounts of DNA between the adducts. DNAs were 
internally labelled with [oP] -dATP. B, bottom strand; BM, bottom strand 
modified with M.Hpall; nt, nucleotide; T, top strand. B, Quantification of 
unwinding from three independent experiments as in A. Error bars indicate s.d. 
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emphasize that replicative helicases in different organisms use a com- 
mon mechanism to unwind DNA. 

Our results also show that large T antigen is able to bypass a protein 
crosslinked to the translocation strand, and they favour a model in 
which the bypass occurs by opening of the hexamer. This unprece- 
dented flexibility of large T antigen may allow the SV40 replisome to 
bypass natural impediments including tightly bound proteins on 
DNA such as transcription complexes or covalently trapped pro- 
teins**”*. If large T antigen initially assembles around dsDNA at the 
origin’®’”’, the ability of the large T antigen ring to open on DNA may 
also facilitate the exclusion of one strand from the central channel 
during replication initiation as proposed”. In the future, it will be 
important to confirm by more direct means that the large T antigen 
ring opens while bypassing a bulky adduct. Furthermore, determining 
whether other replicative helicases exploit similar plasticity to bypass 
replication barriers can guide our understanding of how these 
enzymes interact with DNA damage. 


METHODS SUMMARY 


For single-molecule experiments, Aori DNA was annealed to 3’-biotin-modified 
oligonucleotides at its ss DNA end and attached to the streptavidin-functionalized 
surface of a microfluidic flow cell. Large T antigen and HeLa cell extracts were 
diluted in large T antigen binding (TAB) buffer containing 30 mM Hepes, pH 7.5, 
7mM MgCl, 0.5 mM dithiothreitol and 50 pg ml — ' BSA, and introduced into the 
flow cell. Total internal reflection fluorescence imaging was performed on a 
custom-built setup. Ensemble DNA unwinding assays were performed on radio- 
labelled PCR fragments in TAB buffer supplemented with large T antigen and 
RPA. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Protein expression and purification. SV40 large large T antigen** and RPA*®” 
were expressed and purified as described. Fluorescently tagged RPA was made by 
cloning mKikGR and a histidine-tag onto the amino terminus of the RPA70 
subunit. RPA™<*°8 was purified by Ni-NTA chromatography. 

Preparation of ori. To construct the A DNA with the SV40 origin (Aori), we first 
carried out a series of cloning steps to generate a plasmid that contains a 4.7-kb 
DNA fragment including the 233 bp SV40 origin region from pUC.HSO” at one 
end, a 22-bp sequence (5'-CCTCAGCATAGATGTCCTCAGC-3’) for Qdot 
attachment at the other end, and a 4.5-kb DNA spacer in between, which contains 
a tetracycline-resistance gene, an ampicillin-resistance gene and part of a Bacillus 
subtilis non-essential gene, yrvN. This fragment was PCR-amplified using pri- 
mers flanking with 40-bp sequences homologous to the 4 genome (5’-ACAG 
CCCGCCGGAACCGGTGGGCTTTTTTGTGGGGTGAAT-3’ and 5’-CCTGC 
GGCATATCACAAAACGATTACTCCATAACAGGGACA-3’), and electro- 
porated into a lysogen MC4100 (A CI857 Sam7)" to replace two non-essential 
genes (stf and ¢fa) in the prophage using a recombineering method’. The modi- 
fied 1 DNA was purified from the lysogen by Lofstrand Labs Limited. 
Labelling Aori with Qdot. Aori contains a 22-bp sequence (5’-CCTC 
AGCATAGATGTCCTCAGC-3’) approximately 4kb away from the SV40 ori- 
gin. This sequence has two sequential sites (5’-CCTCAGC-3’) that are recog- 
nized by nicking enzymes Nt.BbvCI and Nb.BbvCI. We used Nt.BbvCI or 
Nb.BbvCI, respectively, to modify either the top or bottom strands of Aori with 
a digoxigenin at a single base according to a previously established protocol’. 
To place a digoxigenin in the top strand, Aori was treated with Nt.BbvCI, mixed 
with 100-fold molar excess of oligonucleotide (5'-TCAGCAT“®AGATGTCC-3', 
Biosynthesis) containing a digoxigenin at T“®, heated to 60 °C, and slowly cooled 
down to room temperature. This procedure leads to melting of a 15-nucleotide 
region between the two Nt.BbvCI sites, which is then replaced by the digoxigenin- 
modified oligonucleotide. The modified oligonucleotide was then ligated to Aori 
using T4 DNA Ligase (NEB). The same procedure was performed to modify the 
bottom strand of Aori with digoxigenin except that DNA was nicked with 
Nb.BbvCI and annealed and ligated to 5’-TGAGGACAT“&CTATGC-3’. 
Attachment of anti-dig-conjugated Qdot605 (Invitrogen) to surface-immobilized 
ori was performed as described’. 

Preparation of DNA constructs. To produce biotin-modified linear DNA for use 
in ensemble unwinding assays, we cloned an insert (5’-CCTCAGCAGATA 
TCACCTCAGC-3’) between the EcoRI and Ndel sites of the pUC.HSO vector*® 
that places the insert approximately 150 nucleotides away from the SV40 origin. A 
518-bp fragment within the resulting vector was PCR-amplified using the fol- 
lowing oligonucleotides: forward: 5'-CCTCCAAAAAAGCCTCCTCACTA-3’, 
reverse: 5’-GTGCCACCTGACGTCTAAGAAACC-3’. 

The length of the DNA substrate was chosen such that dsDNA (Fig. 3B, lanes 1 
and 3), dsDNA bound to streptavidin (Fig. 3B, lanes 2 and 4), ssDNA (Fig. 3B, 
lanes 9 and 11), and ssDNA bound to streptavidin (Fig. 3B, lanes 10 and 12) ran at 
discrete positions on a 3% agarose gel. The PCR fragment contains the origin near 
one end and the 22-bp insert close to the centre (Fig. 3A). We first inserted an 
amine residue on the top or bottom strands using the same template exchange 
strategy as before. DNA was either nicked with Nt.BbvCI and annealed/ligated to 
5'-TGAGGTGAT*™™"ATCTGC-3’ (to modify the top strand), or nicked with 
Nb.BbvCI and annealed/ligated to 5’- TCAGCAGAT*™" ATCACC-3’ (to modify 
the bottom strand). Modified oligonucleotides were purchased from IDT that 
contained a C6-amine at T*””. To attach biotin to the amine-modified thymidine 
residue, amine-modified DNA (400ng) was mixed with NHS-PEG4-biotin 
(2.5 pl of 100 UM dissolved in dimethylsulphoxide, Pierce) in PBS buffer (25 pl 
final volume), and incubated overnight at room temperature. Excess NHS- 
PEG4-amine was removed by running DNA on a 1.5% agarose gel and sub- 
sequent gel extraction of DNA. This protocol generally resulted in half of the 
DNA molecules being modified with biotin at the specific site. To purify bioti- 
nylated DNA selectively, DNA was mixed with streptavidin (0.6 mg ml’ final), 
incubated for at least 2 h and separated on a 1.5% agarose gel. We then excised the 
band corresponding to the streptavidin-bound DNA that ran slower than free 
DNA. Purification of DNA from agarose gel was performed using a QIAquick gel 
extraction kit (Qiagen) unless stated otherwise. We note that streptavidin dis- 
sociated from biotin on DNA during the gel purification process, possibly owing 
to the denaturation of streptavidin by the ethanol wash of the spin column. 

To conjugate M.Hpall covalently to DNA, we first cloned 5’- CCTCAGCA 
TCCGGTACCTCAGC-3’ between the EcoRI and Ndel sites of the pUC.HSO (in 
which bold denotes the sequence recognized by M.Hpall). We then used the same 
primers as before to make a 518-bp PCR fragment containing the origin and 
the insert. The same nicking strategy described above was used to insert a 5- 
fluoro-2'-deoxycytidine (C™°)-modified oligonucleotide to either the top or bot- 
tom strands for subsequent covalent attachment of M.Hpall to DNA***’. The 


PCR fragment was either nicked with Nt.BbvCI and annealed/ligated to 5’- 
TCAGCATCC™°GGTACC-3' (to modify the top strand), or nicked with 
Nb.BbvCl and annealed/ligated to 5’-TGAGGTACC"™°GGATGC-3' (to modify 
the bottom strand). Custom-synthesized 5-fluoro-modified oligonucleotides 
were purchased from Biosynthesis. Modified DNA was gel-purified as described 
above and mixed with M.HpalI (1,000 U ml! final, NEB) in HpalI methylase 
reaction buffer (50mM Tris-HCl, pH7.5, 5mM 2-mercaptoethanol, 10 mM 
EDTA) supplemented with 100M S-adenosylmethionine. After 6-9h of 
incubation at 37°C, DNA was separated on a 1.5% agarose gel and the slow 
migrating band corresponding to M.Hpall-conjugated DNA was excised. To 
prevent denaturation of M.Hpall, DNA was purified by electroelution. In brief, 
the gel fragment containing M.HpalI-bound DNA was placed in a dialysis tube 
(3.5 kilodalton cut-off) filled with TBE buffer, and electrophoresis was used to run 
the DNA off the gel into the dialysis tube. The gel fragment was then removed 
from the tube, and DNA was dialysed against 10 mM Tris, pH 8, and concentrated 
using a spin concentrator (Ultrafree Biomax-10K, ThermoFisher Scientific). 

To generate DNA constructs containing two M.Hpall adducts, we first cloned 
two inserts of the same sequence (5’-CCTCAGCATCCGGTACCTCAGC-3’; in 
which bold denotes the sequence recognized by M.HpalI) into pUC.HSO that were 
separated by reported lengths. After making a PCR product of the desired length, 
the same protocol as above was used for insertion of 5-fluoro-2'-deoxycytidine- 
modified oligonucleotide and subsequent M.Hpall conjugation. 
Single-molecule DNA replication and unwinding assays. A microfluidic flow 
cell with a streptavidin-functionalized bottom surface was prepared as 
described’. Similar to the wild-type 2 DNA, Aori contains 12-nucleotide 
ssDNA at both ends. Aori was biotinylated and singly or doubly tethered to the 
surface using biotin-modified oligonucleotides complementary to these ends as 
described’*. For large-T-antigen-dependent replication of ori in HeLa cell 
extracts, the flow cell containing surface-immobilized Aori molecules was placed 
on a heat block (VWR) set to 37 °C. The flow cell was flushed with 50 ul large T 
antigen binding (TAB) buffer containing 30 mM Hepes, pH7.5, 7mM MgCl, 
0.5 mM dithiothreitol (DTT) and 50 pg ml ' BSA. Large T antigen (10 pg ml *) 
was introduced into the flow cell in TAB buffer supplemented with an ATP 
regeneration mixture (4mM ATP, 50 mM phosphocreatine and 10 ng pl * cre- 
atine phosphokinase), and 5ng ul‘ competitor DNA (pUC19) for origin-spe- 
cific assembly of large T antigen’®, and incubated for the indicated times. 

For large-T-antigen-dependent replication, a replication mixture was prepared 
by diluting 15 ul HeLa cell extract (Chimerx) twofold in buffer containing 30 mM 
Hepes, pH 7.5, 7mM MgCh, 0.5mM DTT, 50 pg ml! BSA, 4mM ATP, 50mM 
phosphocreatine, 10 1.gml~! creatine phosphokinase, 0.1mM each of dATP, 
dGTP, dTTP and dCTP, and 50 tM each of GTP, TTP and CTP (all concentra- 
tions final). The replication mixture was drawn into the flow cell and incubated 
for the indicated times. To confirm bidirectional replication, a second replication 
mixture containing 1.7 1M dig-dUTP was drawn into the flow cell and further 
incubated. Competitor DNA (pUC19; 5 ng til’) was included to suppress non- 
specific interaction of proteins in the extract with dori that interferes with fork 
progression (data not shown). To stop the reaction, the flow cell was washed 
with buffer containing 20mM Tris, pH7.5, 50mM NaCl, 12mM EDTA and 
0.1% SDS. 

To visualize unwinding of Aori by large T antigen in real time, large T antigen 
was first assembled on doubly tethered ori as described above. RPA™ CR 
(15 nM) was then drawn into the flow cell in TAB supplemented with the ATP 
regeneration mixture. The flow cell was heated to 37 °C on a microscope stage 
with an objective heater (Bioptechs) during large T antigen assembly and sub- 
sequent unwinding reactions. Imaging of RPA™ “C8 is described below. 
Ensemble DNA unwinding assays. PCR fragments were labelled either inter- 
nally with [o°?P]-dATP (Perkin Elmer) during the PCR reaction, or at their 5’ 
ends using T4 polynucleotide kinase (NEB) and [y°?P]-ATP (Perkin Elmer). 
Unwinding reactions were performed with 0.3-0.5 ng pl‘ labelled DNA, 10 jig 
ml ' large T antigen, 40 1g ml’ RPA and the ATP regeneration mixture in TAB 
and incubated for the indicated amount of time at 37 °C. Unlabelled competitor 
DNA (Fspl linearized pUC19; 5 ng pl *) was included in unwinding reactions to 
ensure origin-specific unwinding. The reaction was stopped with 0.5% SDS and 
20mM EDTA, DNA was separated on a 3% agarose gel, and visualized using a 
phosphorimager. Quantification of band intensities was carried out using 
Quantity One software. 

To prepare 5’-biotin-modified DNA for bead-immobilization, one of the pri- 
mers used in the PCR reaction contained biotin at the 5’ end. Fifteen nanograms 
of DNA was mixed with streptavidin-coated magnetic beads (Invitrogen) in 
10 mM Tris supplemented with 0.2 mg ml’ BSA. Beads were washed twice using 
a magnet, resuspended in TAB buffer containing the ATP regeneration mixture 
and 201g ml! large T antigen, and incubated for 2.5h at 37°C rotating on a 
rotisserie. Beads were washed twice with TAB buffer containing 0.6 mM ATP, 
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resuspended in TAB buffer supplemented with the ATP regeneration mixture, 
40 ugml~' RPA, 50 uM biotin, 15ng cold DNA containing the origin, and 
incubated for 30 min at 37 °C before stopping the reaction. 

Single-molecule imaging. Total internal reflection fluorescence imaging was 
carried out on an inverted microscope (Olympus [X-71) using a X60 oil objective 
(PlanApo, numerical aperture = 1.45, Olympus). SYTOX and fluorescein-anti- 
dig images on surface-immobilized DNA after replication in HeLa extracts were 


recorded as described”*. Live monitoring of RP 


A™RSEGR was performed by cap- 


turing an image every minute using 488-nm laser light. 
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Structure of the TatC core of the 
twin-arginine protein transport system 


Sarah E. Rollauer’?, Michael J. Tarry°t, James E. Graham’*¥, Mari Jaaskeldinen®, Franziska J ager’, Steven Johnson!, 
Martin Krehenbrink’, Sai-Man Liu’?, Michael J. Lukey"+, Julien Marcoux®, Melanie A. McDowell’, Fernanda Rodriguez”, _ 
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The twin-arginine translocation (Tat) pathway is one of two general protein transport systems found in the prokaryotic 
cytoplasmic membrane and is conserved in the thylakoid membrane of plant chloroplasts. The defining, and highly 
unusual, property of the Tat pathway is that it transports folded proteins, a task that must be achieved without allowing 
appreciable ion leakage across the membrane. The integral membrane TatC protein is the central component of the Tat 
pathway. TatC captures substrate proteins by binding their signal peptides. TatC then recruits TatA family proteins to 
form the active translocation complex. Here we report the crystal structure of TatC from the hyperthermophilic 
bacterium Aquifex aeolicus. This structure provides a molecular description of the core of the Tat translocation 
system and a framework for understanding the unique Tat transport mechanism. 


The Sec’ and Tat** protein transport pathways are responsible for 
protein export across the cytoplasmic membrane of bacteria and 
archaea, and for protein import into the thylakoids of choloroplasts. 
The Sec system threads unfolded proteins through a membrane- 
bound channel’. By contrast, the Tat pathway transports proteins 
that have already attained a folded state’, a more challenging task 
because folded proteins have a much larger cross-section than an 
unfolded peptide and so require a larger transport pathway. In addi- 
tion, folded proteins have a diverse range of shapes and sizes, making 
it difficult to seal tightly around the protein during transport to 
preserve the membrane permeability barrier. How the Tat apparatus 
is able to accomplish this feat is unknown. In bacteria the Tat path- 
way has an important role in energy metabolism, nutrient acquisition 
and virulence**®, whereas in plants the Tat pathway is essential for 
photosynthesis’. 

Tat transport is carried out by the integral membrane proteins 
TatA*°, TatB® and TatC’®. TatA and TatB are structurally related and 
contain a single transmembrane helix followed by an amphipathic 
helix*"’, whereas TatC has multiple transmembrane helices’. Proteins 
are targeted to the Tat pathway by amino-terminal signal peptides 
possessing a twin-arginine-containing sequence motif'* “* that is recog- 
nized by the TatC protein within a TatBC complex’*”’. Binding of 
substrate proteins to the TatBC complex induces recruitment of TatA 
to assemble a transient TatABC-containing translocation site'*”°”', 
which is thought to facilitate transport by perturbing the membrane 
bilayer**”. 

TatC emerges as the core organizing component of the Tat pathway, 
directly and dynamically binding substrate, TatB and TatA”’, and 
maintaining these interactions during the transport step’®*’. A 
molecular-level understanding of TatC, the largest and most conserved 
element of the Tat translocation machinery, is required to begin to 
understand the remarkable Tat transport mechanism. 


Structure of TatC 


We targeted TatC proteins from thermophilic bacteria and archaea for 
structure determination. Aquifex aeolicus TatC solubilized in the deter- 
gent lauryl maltose neopentyl glycol (LMNG)* formed crystals that dif- 
fracted to high resolution. LMNG-solubilized A. aeolicus TatC was able to 
bind a synthetic Tat signal peptide (dissociation constant (Kg) = 3 1M) at 
an approximately 1:1 protein:peptide ratio (Fig. 1a), but not a functionally 
inactive variant signal peptide’*'®’° (Fig. 1b). Thus, the protein used for 
crystallization was functional for Tat signal peptide binding. Identical 
binding affinities were measured for the full-length signal peptide and 
for a truncated signal peptide comprising only the twin-arginine-con- 
taining n-region (Fig. 1c), consistent with crosslinking studies which 
indicate that TatC contacts the n-region of the signal peptide’*’’. 

Crystals of selenomethionine-labelled A. aeolicus TatC (Supplementary 
Fig. 1) were produced and the structure of the protein solved at a resolu- 
tion of 3.5 A using anomalous dispersion (Supplementary Fig. 2a, b). The 
first four and last eleven amino acids of A. aeolicus TatC are disordered 
and are not included in the structure (Supplementary Fig. 3). The final 
refined structure has an Rye. of 28.8% (Supplementary Table 1). The A. 
aeolicus TatC fold is well-maintained in atomistic molecular dynamics 
(MD) simulations in a phospholipid bilayer, indicating that the structure 
would be stable in a membrane environment (Supplementary Fig. 4). A. 
aeolicus TatC shares 40% sequence identity with Escherichia coli TatC 
(Supplementary Fig. 3), allowing functional data from the E. coli Tat 
system to be readily mapped to the A. aeolicus TatC structure. 

A. aeolicus TatC (Fig. 2a) contains six transmembrane helices 
(TM1-TM6), as anticipated’?”**, which experimental evidence 
shows are oriented with the amino and carboxy termini of the protein 
at the cytoplasmic side of the membrane*”~*®. MD simulations of A. 
aeolicus TatC in a phospholipid environment indicate that little of the 
structure protrudes outside the membrane bilayer (Fig. 2a and 
Supplementary Fig. 5). Three helices (TM2, TM3 and TM5) are 
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Figure 1 | Purified A. aeolicus TatC binds Tat signal peptides. 

a-c, Isothermal titration calorimetry was used to assess interactions between A. 
aeolicus TatC and the Tat signal peptide of the E. coli protein Sufl (a), a 
functionally inactive variant Sufl signal peptide in which the twin arginine 
residues have been substituted with twin lysine residues (b) and the n-domain 
of the Sufl signal peptide (c). d, Interaction between the n-domain peptide and 
an A. aeolicus TatC E96A variant. 


Figure 2 | Structure of A. aeolicus TatC. a, Cartoon representation of A. 
aeolicus TatC coloured from blue at the N terminus to red at the C terminus. 
The position within the membrane bilayer is predicted by MD simulations 
(Supplementary Fig. 5). A semi-ordered molecule of LMNG detergent is present 
and is shown in purple. b, Surface representation of the A. aeolicus TatC 
structure. The left hand views in a and b are the same orientation. c, Positions of 
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strongly kinked within their transmembrane spans, whereas TM1 
kinks abruptly towards the periplasmic side of the membrane to form 
an interfacial helix (helix 1b). The six transmembrane helices, with the 
exception of the C-terminal half of TM2 following the kink, are tilted 
at angles between 20° and 40° to the membrane normal. TM5 and 
TM6are too short to span the membrane bilayer fully. Indeed, in MD 
simulations the membrane is distorted inwards at the ends of these 
helices (Supplementary Fig. 5). The restricted length of these helices is 
probably required, in part, to accommodate the TatB partner (see 
below) but it is also possible that the bilayer distortion caused by 
TM5 and TM6 may assist in forming the protein translocation path- 
way. The loops joining the transmembrane helices on the cytoplasmic 
face of A. aeolicus TatC are short. However, at the periplasmic face of 
the membrane the loops are more elaborate, with the interfacial helix 
1b and succeeding loop into TM2 making extensive interactions with 
the TM3-TM4 loop. The result is a structured periplasmic cap that 
stabilizes the relative positions and tilted orientations of the trans- 
membrane helices. The presence of the cap structure explains why 
insertions and deletions in this region are not tolerated*'’* and why 
these loops are hotspots for inactivating mutations” (Fig. 2c). 

The overall shape of A. aeolicus TatC resembles a cupped hand in which 
the transmembrane helices form a curved wall overhung on the concave 
face by the periplasmic cap (Fig. 2b). The result is a structure with a notably 
high surface area (~16,500 A”) for the length of protein. A gap between 
the short TM5 and TM6 helices and the cap results in a prominent groove 
leading from the concave face of A. aeolicus TatC to the periplasm. The 
A. aeolicus TatC fold shows no similarity to previously described mem- 
brane protein structures. Importantly, the structure of TatC is unrelated 
to that of the Sec translocase even though the two proteins bind 
signal peptides of similar overall structure’*. Crystallographic B-factors 
suggest that A. aeolicus TatC has very restricted structural flexibility 
(Supplementary Fig. 6a). Similarly, MD simulations of A. aeolicus TatC 


inactivating substitutions in E. coli TatC identified by genetic screening” (green), 


17,34 ( 


targeted substitution'”** (orange), or both (yellow), mapped to A. aeolicus TatC 
(full details in Supplementary Table 2). d, Sequence conservation at the surface of 
TatC displayed in the Consurf"* colouring scheme in which maroon represents 
high, and cyan low, sequence conservation. ¢, d, Dotted regions labelled (1) and 
(2) highlight conserved functional regions of the structure. 
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in a phospholipid bilayer show limited deformation modes (Supplemen- 
tary Fig. 6b). It is, therefore, unlikely that TatC undergoes large conforma- 
tional changes as typically found in small-molecule transporters”. 

A notable feature of the surface of A. aeolicus TatC is that the side chain 
of Glu 165 is exposed at the centre of the concave face, thereby placing an 
ionisable group in the hydrophobic interior of the bilayer (Fig. 3a). This 
residue is highly conserved as glutamate or glutamine. Substitution of the 
equivalent residue in E. coli TatC (Glu 170) with alanine does not affect 
signal peptide binding”, but severely compromises overall Tat transport 
activity'’**. In MD simulations Glu 165 is hydrated and perturbs the local 
bilayer environment by attracting a lipid head group. This suggests that 
burying Glu 165 in the membrane is highly unfavourable and that 
Glu 165 has an interaction partner in the cell. 

Tat transport is energized by the proton electrochemical gradient”. 
However, no membrane surface-linked channels or transmembrane 
hydrogen bonding networks are apparent within A. aeolicus TatC. 
Glu 165 is part of a short hydrogen-bonding network with Ser 107 and 
Trp 85 (Supplementary Fig. 7) but this is isolated from the membrane 
surface. Thus, if TatC participates in coupling transport to transmembrane 
proton movement, it does so in conjunction with other components. 

To highlight functionally important regions of TatC we plotted amino 
acid substitutions that suppress the transport function of E. coli TatC (refs 
17, 32, 34) on to the equivalent positions in the A. aeolicus TatC structure 
(Fig. 2c). We also mapped sequence conservation on to the protein surface 
(Fig. 2d). There are two clusters of conserved and functionally important 
residues on the surface of TatC (labelled (1) and (2) in Fig. 2c, d). These are 
located at each end of the TatC molecule, on opposite sides of the mem- 
brane, and probably correspond to sites of interactions with partner pro- 
teins. A sequence covariance analysis shows that the strongest structural 
constraints in TatC are within those helix pairs (TM1/TM2 and TM5/ 
TM6) that form the structural framework underlying the two functionally 
conserved surface sites (Supplementary Fig. 8 and Supplementary Table 
3). Functionally important residues in the interior of the TatC molecule 
have structural roles mediating transmembrane kinks (conserved) or 
stabilizing the periplasmic cap (weakly conserved). 


Signal peptide recognition 
Residues in E. coli TatC that may be involved in signal peptide binding 
have been identified by crosslinking and genetic analysis’?"*°?°”. 


c 
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The equivalent residues in A. aeolicus TatC cluster on the cytoplasmic 
face of TM1-TM3 within conserved functional surface site (1) 
(Fig. 3b). Within this site the two negatively charged amino acids 
Glu9 and Glu 96 would be appropriately positioned to coordinate 
the positively charged guanidinium groups of the sequential arginine 
residues of the signal peptide. 

To directly test the involvement of this part of A. aeolicus TatC in 
signal peptide interactions we measured the signal peptide binding 
affinities of single amino acid substitutions within the site (Figs 1d and 
3c, and Supplementary Fig. 9). All the substitutions, including those 
of Glu9 and Glu 96, resulted in large reductions in the affinity of 
A. aeolicus TatC for the n-region peptide. By contrast, changing a 
functionally important** residue in surface site (2) had no effect on 
signal peptide binding (Fig. 3c and Supplementary Fig. 9). This ana- 
lysis confirms that the twin-arginine-containing region of Tat signal 
peptides bind to surface site (1). 


Interactions with Tat components 


In E. coli, and in plant thylakoids, multiple copies of TatC are pre- 
sent in the TatBC complex******”° and TatC has been reported to 
self-interact*’, possibly as local TatC dimers*’. We considered whether 
the protomer arrangement in the A. aeolicus TatC crystal might reflect 
such interactions. One of two packing interactions in the crystals 
(between molecules X and X’’ in Fig. 4a) forms a topologically rea- 
sonable dimer (752 A X2 buried surface area; Supplementary Fig. 10), 
and biochemical characterization confirms that the LMNG-solubilized 
protein is a dimer in solution (Supplementary Fig. 11). Nevertheless, 
the low sequence conservation of the subunit interface (Fig. 2d and 
Supplementary Fig. 10), the results of sequence covariance analysis 
(Supplementary Fig. 8 and Supplementary Table 3), and inconsistency 
with site-specific photocrosslinking results (Fig. 4b)** all argue against 
the biological relevance of this dimer. More generally, the lack of 
plausible inter-subunit contacts in the covariance analysis suggests that 
interactions between TatC protomers are not subject to specific struc- 
tural or mechanistic constraints. 

In Gram-negative bacteria and plants TatC forms a tight com- 
plex with TatB. Known contact information for the two proteins in 
E. coli”**°* maps to the periplasmic end of conserved functional 
surface patch (2) in TatC (Fig. 4b) and to the periplasmic” N terminus 


Figure 3 | Identification of the signal peptide 
(a Sn binding site on TatC. a, The electrostatic surface 
of TatC showing negatively charged regions as red 
and positively charged regions as blue. b, E. coli 
TatC residues implicated in signal peptide or 
4 substrate interactions mapped to A. aeolicus TatC 
(full details in Supplementary Table 4). 
3 Substitutions that prevent signal peptide 
) crosslinking to E. coli TatC (ref. 17) are shown in 
) purple. Positions that give prominent photo- 
affinity crosslinks to substrate proteins”° are shown 
’ in green. Single amino acid substitutions that 
enable transport of substrates with defects in the 
signal peptide twin-arginine motif"*!°*” are shown 
in cyan. c, Effects of the indicated amino acid 
substitutions on the affinity of A. aeolicus TatC for 
the Sufl n-region peptide. Substitutions that affect 
signal peptide binding are coloured red or pink, 
and those that do not are coloured green. The 
dh * corresponding experimental data are shown in 
be Fig. 1d and Supplementary Fig. 9. The dotted 
# region in a-c corresponds to region (1) in Fig. 2c, d. 
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of the TatB transmembrane helix* (TatB(TM)). To delineate further 
the proposed TatB-TatC contact site we undertook limited disulphide 
scanning mutagenesis of the E. coli TatB and TatC proteins in these 
areas (Fig. 4c). In the A. aeolicus TatC crystals the contact between 
molecules X and X’ involves packing an inverted TM5 from one TatC 
molecule against the proposed TatB contact site in the other (Fig. 4a), 
showing that a transmembrane helix with the same orientation as 
TatB(TM) (N terminus at the periplasmic side of the membrane) is 
able to pack at this site. On the basis of this crystallographic contact we 
produced a model for the TatB(ITM)-TatC complex consistent with 
the experimental interaction data (Fig. 4d). A notable feature of the 
model is that the C-terminal half of TMS is well matched in length to 
the short TatB(TM) to which it binds. 

Although our analyses localize the signal peptide and TatB binding 
sites to opposite ends of the TatC molecule, some evidence exists for 
functional and structural connections between TatB and signal peptides. 
First, photocrosslinks to TatB have been reported in the vicinity of the 
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Figure 4 | Sites of interaction with other Tat components. a, A. aeolicus TatC 
crystal contacts. The contact between molecules X and X’ involves a packing 
interaction between antiparallel TM5 helices, whereas that between molecules 
X and X”’ involves the semi-ordered LMNG molecule shown in purple. 

b, Positions in E. coli TatC that have been reported****** to interact with both 
TatA and TatB (green), with TatB (magenta), or with TatC (orange) mapped on 
to A. aeolicus TatC (full details in Supplementary Table 5). The dotted region 
corresponds to region (2) in Fig. 2c, d. c, Disulphide crosslinking between E. coli 
TatB and TatC variants detected by immunoblotting with TatB antibodies. 
Lanes are untreated (C), oxidized with Cu(11)phenanthroline (O), or oxidized 
and then reduced with DTT (R). d, The complex between TatC and the 
transmembrane helix of TatB modelled on the A. aeolicus TatC X-X’ crystal 
contact. Positions in E. coli TatC that form a disulphide bond to a E. coli TatB 
L10C variant (yellow), or positions in E. coli TatB that forma disulphide bond to 
a E. coli TatC M205C variant (blue), (data from c) are mapped on to the model. 
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Figure 5 | A conceptual model for the environment of TatC in the substrate- 
bound translocation site. The model is viewed from the convex (left) and 
concave (right) sides of TatC. The interaction of TatC with the transmembrane 
helix of TatB is modelled as in Fig. 4d. The TatB amphipathic helices are positioned 
to allow TatB to be in the vicinity of the N terminus of TatC as shown by 
crosslinking®*. The TatA transmembrane helix is shown bound to the concave face 
of TatC based on in silico docking experiments (Supplementary Fig. 12). The 
position of the TatA amphipathic helix is unclear and is omitted from the 
conceptual model. TatA forms large polymers but only representative TatA 
molecules directly contacting TatC are shown. For simplicity, the model depicts a 
single TatC molecule but each TatC protein will be operating in the context of a 
larger TatBC complex. 


signal peptide binding site in TatC** (Fig. 4b). Second, the signal peptide 
h-region, which directly follows the n-region, photocrosslinks predomi- 
nantly to TatB!>"®. Third, some substitutions that allow transport of 
substrates with defective Tat signal peptides map to the N terminus of 
TatB(TM)'**’. These data suggest that the amphipathic helix of TatB 
could run across the cytoplasmic face of TatC from the TatB(TM) 
binding site at TM5 to the signal peptide binding site at TM1 (Fig. 5). 
Alternatively, some of these interactions may relate to a stage in the 
transport cycle where the signal peptide is reaching across the membrane 
from the point of attachment of the twin-arginine motif at region (1) to 
the TatB(TM)-binding site in region (2). It is also possible that it is a TatB 
molecule bound to a neighbouring TatC molecule within the TatBC 
complex that is close to the signal peptide binding site. 

The interaction of TatA with TatC is transient and may involve 
multiple TatA protomers binding to a single TatC protein. The only 
identified points of interaction between the two proteins are photo- 
crosslinks*® at the periplasmic side of the membrane close to the 
modelled position of the TatB N terminus (Fig. 4b). This suggests 
that the periplasmically located N terminus” of TatA can reach this 
region. In the absence of more detailed experimental information 
on the TatA binding site(s) we carried out in silico docking experi- 
ments and MD simulations using a homology model of the A. aeolicus 
TatA transmembrane helix (TatA(TM)). The results suggest that 
TatA(TM) binds within the concave face of TatC (Supplementary 
Fig. 12). This location might allow TatA to act as the interaction 
partner for exposed Glu 165, perhaps via interaction with the func- 
tionally essential conserved Gln/Glu in TatA(TM)*. The relatively 
large and concave surface of this region of TatC could promote a 
binding-induced conformational change in the bound TatA proto- 
mers that would allow them to act as the nucleation site for TatA 
polymerization. 

The conceptual model shown in Fig. 5 summarizes the possible 
arrangement of the components of an active Tat translocation site 
indicated by the A. aeolicus TatC structure. 


METHODS SUMMARY 


Native and SeMet A. aeolicus TatC were heterologously expressed in E. coli as GFP 
fusions“ and purified in LMNG. After GFP removal the protein was crystallized by 
the sitting-drop vapour diffusion method at 21°C in 0.2 M ammonium sulphate, 
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0.02 M NaCl, 0.02 M sodium acetate pH 4.0, 33% v/v PEG 200. Diffraction data were 
collected at the Diamond light source (Beamline 104-1) at 120K from four SeMet- 
labelled crystals frozen directly in mother liquor (A = 0.92 A). The crystals diffracted 
to 3.5 A and belonged to the space group H32. The data were phased using single- 
wavelength anomalous dispersion. The final structure has a Molprobity*” Score of 3.1 
(84th percentile for structures 3.5 A+0.25 A). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Selection of clones, growth conditions and solubilizing detergent. A range of 
tatC genes from thermophilic bacteria and archaea were cloned into plasmid 
pWaldo-TEV-GFPe™ to allow expression in E. coli as N-terminal fusions to 
octahistidine-tagged GFP (TatC-GFP-8His). The resulting fusion proteins con- 
tain a TEV protease-cleavable linker between the TatC and GFP domains. The 
clones were cultured in a variety of background strains and growth conditions 
and TatC-GFP expression assessed by whole-cell fluorescence***’. This screen 
identified Aquifex aeolicus TatC as suitable for high-level protein production. The 
A. aeolicus tatC gene* contains two possible initiation codons. Although sequence 
comparisons between TatC proteins from within the phylum Aquificae indicate 
that the first of these codons is the authentic translation start site, the construct 
used here starts from the second methionine codon, thereby removing eight poorly 
conserved amino acids (sequence MSQEKLPE) from the N terminus of the pro- 
tein. Fluorescence size exclusion chromatography (FSEC)” was used to identify 
detergents that were able to solubilize the A. aeolicus TatC-GFP fusion protein ina 
monodisperse state. Subsequently size exclusion chromatography-multi-angle 
laser light scattering (SEC-MALLS) was used to assess whether these detergents 
were able to maintain TatC in a monodisperse state following proteolytic removal 
of the GFP domain. Detergents meeting this selection criterion were used to purify 
A. aeolicus TatC for crystallization trials. 

Protein purification and crystallization. The A. aeolicus TatC-GFP construct 
was recombinantly expressed in E. coli Lemo56(DE3)” cells cultured at 37 °C in 
Terrific Broth supplemented with 50 tg ml” ' kanamycin. When the cells attained 
OD¢00 = 5.0, 0.1 mM IPTG was added and the cells cultured for a further 16h at 
24°C. Cells were disrupted by homogenization (Emulsifex C5, Avestin) and 
membranes recovered by differential centrifugation. Solubilization was carried 
out at 4 °C for 2 h in 1% w/v lauryl maltose neopentyl glycol (LMNG; Anatrace) in 
PBS containing 20 mM imidazole. The soluble fraction was loaded on to a 5 ml 
Ni-NTA column (Qiagen). The column was washed with five column volumes of 
PBS containing 0.02% LMNG and 25mM imidazole, then with two column 
volumes of the same buffer containing 40 mM imidazole, and eluted with the 
same buffer containing 250 mM imidazole. TatC-GFP-containing fractions were 
pooled and incubated with 2 mg His-tagged TEV protease, 5 mM sodium citrate 
pH6.3, 0.3mM oxidized glutathione and 3 mM reduced glutathione (final con- 
centrations). This mixture was dialysed in a 10-kDa molecular mass cutoff 
membrane (‘snakeskin tubing’, Fisher scientific) against size exclusion (SEC) 
buffer (20 mM HEPES pH 7.5, 150mM NaCl, 0.02% w/v LMNG) at 4°C over- 
night. Passage over a 5ml Ni-NTA column removed the His-tagged GFP and 
His-tagged TEV protease. TatC was concentrated in a centrifugal concentrator 
device (100 kDa molecular mass cutoff membrane, Millipore) to 5 ml, and loaded 
on to a Superdex 200 16/60 gel filtration column (GE Healthcare) pre-equilibrated 
in SEC buffer. Final peak fractions were concentrated to ~8 mg ml’. 

Selenomethionine-substituted TatC was expressed in E. coli L56 cells grown in 
M9 minimal medium, using the feedback inhibition of methionine biosynthesis 
method*®. Cells were grown at 37 °C to an ODg¢o9 = 0.7 and induced as for the 
native protein. Purification was carried out as for the native protein but the 
final size exclusion buffer was supplemented with 5mM f-mercaptoethanol. 
Full incorporation of selenomethionine was confirmed by mass spectrometry 
(Supplementary Fig. 1). 

Crystallization was carried out using the sitting-drop vapour diffusion method 
at 21°C with a drop size of 200nl. The best crystals were obtained in 0.2M 
ammonium sulphate, 0.02 M NaCl, 0.02M sodium acetate pH 4.0, 33% v/v 
PEG 200 at a 50:50 ratio of protein:mother liquor and a drop volume of 400 nl. 
Crystals of the selenomethionine protein were obtained in the same condition. 
These were flash frozen in liquid nitrogen in the mother liquor, diffracted to 3.5 A, 
and belonged to the space group H32. 

Data collection, structure determination and refinement. Diffraction data were 
collected at the Diamond light source (Beamline 104-1), at 120K from four SeMet- 
labelled crystals (A = 0.92 A). The data were processed using xia2°' and data sets 
were scaled together using autoSHARP™. The first two selenium sites were located 
using Phenix Hyss*’, and the final two positions identified in SHARP LLG gradient 
maps™. The hand was determined by SOLOMON” solvent flattening (CC for 
observed versus solvent flattened Es was 45% versus 51%). The main chain was 
traced by hand as a continuous stretch of 220 residues in the experimental map 
with the SeMet sites allowing unambiguous determination of the chain direction 
and docking of sequence in the C-terminal portion of the protein. After initial 
refinement in autoBUSTER®™, Buccaneer” was used to autobuild a model which 
had the correct sequence throughout. Further rounds of building using COOT** 
and refinement in autoBUSTER followed to give a final model for residues 5-229 
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with R = 25.2% and Rfree = 28.8%. 92% of residues lie in the favoured regions of 
the Ramachandran plot and the structure has a Molprobity*’ score of 3.1 (84th 
percentile for structures 3.5 A+ 0.25 A). The quality of the electron density map is 
shown in Supplementary Fig. 2. There is one TatC protomer in the asymmetric 
unit and one molecule of LMNG located on a crystallographic two-fold (occu- 
pancy of all LMNG atoms set to 0.5). 

The coordinates and experimental data have been deposited at the Protein Data 
Bank with accession code 4b4a. 

Images for publication were generated using PyMol Version 1.5.0.4 (http:// 
www.pymol.org; Schrodinger, LLC). 
Protein analysis. Isothermal titration calorimetry was carried out using an iTC299 
isothermal titration calorimeter (Microcal) using synthetic peptides (Generon 
Ltd) as the titrant. The n-region peptide was amidated on the C terminus. 
Titrations were carried out at 25 °C using a reference power of 3 cals‘ ata stirrer 
speed of 1,000 r.p.m. Isotherms were analysed and fitted in SEDPHAT” using a 
hetero-association model (A + B <> AB) with 4H and Kg as fitting parameters. 
To allow for inaccuracies in peptide concentrations an inactive peptide fraction 
was also included as a fitting parameter. Concentrations of TatC were measured 
by absorption at 280 nm in 6 M guanidinium chloride using an extinction coef- 
ficient of 24,870 M! cm! calculated from the sequence. 

Disulphide-crosslinking experiments were carried out at 24°C as described 
previously*”’. 
Molecular dynamics simulations and in silico docking. All MD simulations 
were performed using GROMACS v4.5.4 at 323 K®. Coarse-grain MD simula- 
tions were run for 1 jts to permit the assembly and equilibration of a dipalmi- 
toylphosphatidylcholine (DPPC) bilayer around the protein®’ using a Martini 
force field (Supplementary Fig. 5). The systems were then converted to atomistic 
using the CG2AT-align method”. The atomistic systems used the Gromos 53a6 
force field and were equilibrated for 1 ns with the protein restrained, before 100 ns 
of unrestrained atomistic MD. 
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The clinical efficacy and safety of a drug is determined by its activity profile across many proteins in the proteome. 
However, designing drugs with a specific multi-target profile is both complex and difficult. Therefore methods to design 
drugs rationally a priori against profiles of several proteins would have immense value in drug discovery. Here we 
describe a new approach for the automated design of ligands against profiles of multiple drug targets. The method is 
demonstrated by the evolution of an approved acetylcholinesterase inhibitor drug into brain-penetrable ligands with 
either specific polypharmacology or exquisite selectivity profiles for G-protein-coupled receptors. Overall, 800 ligand- 
target predictions of prospectively designed ligands were tested experimentally, of which 75°% were confirmed to be 
correct. We also demonstrate target engagement in vivo. The approach can be a useful source of drug leads when 
multi-target profiles are required to achieve either selectivity over other drug targets or a desired polypharmacology. 


The safety and efficacy ofa drug is determined not only by its action on an 
individual protein, but also by its interactions with many proteins in the 
proteome. The promiscuous interaction of a drug with undesired pro- 
teins frequently causes toxicity’ and adverse effects**. Conversely, a single 
drug target can be therapeutically insufficient, particularly in complex 
neuropsychiatric conditions, infectious diseases or cancer**. Instead, it is 
frequently necessary for a drug to engage two or more targets simulta- 
neously for therapeutic efficacy’. Psychiatric drugs in particular require 
multiple activities against several targets to modulate therapeutically 
complex neuropsychiatric domains including perception, cognition 
and emotion*. However, designing drugs with a specific multi-target 
profile—to achieve either exquisite selectivity over other drug targets or 
a desired polypharmacology—is a complex and exceedingly difficult task 
for medicinal chemistry’. Accordingly, methods are needed to enable 
drugs to be designed a priori against several molecular targets simulta- 
neously. Here we describe a solution to the complex problem of designing 
ligands against multiple drug target profiles by automated design. 


From prediction to design 

The problem of designing ligands against a multi-target profile involves 
the parallel optimization of multiple structure-activity relationships 
within a desired range of physicochemical properties. The prospect of 
multi-target drug design has been recently aided by the development of 
computational methods that show success in predicting the molecular 
targets of drugs*’-** (Supplementary Fig. 1), although such approaches 
are not intrinsically design methods. 

Drug design can be modelled as an evolutionary process of iterative 
cycles of exploration and analysis’*’. Adaptive design processes are 
efficient at solving complex, multi-objective problems. Accordingly, 
we developed an automated, adaptive design approach to optimize 
ligands against polypharmacological profiles. 


Several de novo drug design methods have been proposed previ- 
ously’® *!, However, of those that have been experimentally tested***, 
high affinity ligands have been described only rarely and these are all 
against a single molecular target objective**”®. In contrast to previous 
de novo approaches, we mimicked the creative process by automated 
learning of medicinal chemistry design tactics, applying these auto- 
mated learning approaches to the generation of analogues, and then 
prioritizing them relative to a set of objectives (Fig. 1a). The develop- 
ment of this approach is described below, starting from ‘off-target’ 
predictions, progressing to ligand design, and finally to the discovery 
of novel compounds with predefined multi-target profiles. 


Evolution of a drug 


James Black proposed that, “the most fruitful basis for the discovery of a 
new drug is to start with an old drug””’. Accordingly, we tested whether 
the algorithm could automate the evolution of new biological activities, 
starting from a known drug. Donepezil (compound 1) is an acetylcho- 
linesterase inhibitor approved for cognitive enhancement in Alzheimer’s 
disease. Bayesian probabilistic activity models’, for 784 molecular targets 
built from the ChEMBL database”, predicted a moderate likelihood that 
donepezil possessed D4 dopamine receptor activity and a low chance of 
D2 dopamine receptor activity (Supplementary Table 1). We found 
donepezil was a moderately potent D4 inverse agonist (inhibition con- 
stant K, = 614 nM) with minimal D2 activity (Supplementary Figs 2 and 
3 and Supplementary Tables 2 and 3). The D4 inverse agonist activity of 
donepezil is intriguing given analyses demonstrating a significant 
improvement in memory in the trail making test with this drug” and 
findings that D4 antagonists can prevent stress-induced cognitive dys- 
function in primates”. 

We tested our method by evolving the structure of donepezil with 
the dual objectives of improving D2 activity and achieving blood-brain 
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Figure 1 | Adaptive drug design. a, Closed loop of automated ligand design 
algorithm by multi-objective evolutionary optimization. b, Multi-objective 
prioritization by vector scalarization. The multi-target objectives are defined as 
the coordinates of the ideal achievement point, O (gold cross), and the predicted 
values of each generated compound (coloured circles) are also defined as 
coordinates in a multi-dimensional space. The Pareto frontier is displayed as a red 
dotted line. The multi-objective prioritization is inverse to the magnitude of 
vectors (||a|| < ||b|| < ||c||). Compound A is prioritized the highest. 
Compounds C and D have the same vector length (||c|| = ||d||) and are thus 
prioritized equally and above the Pareto optimal compounds E and F. 

c, Evolution of donepezil (compound 1) (19% inhibition of D2 receptor at 10 1M; 
D2 dopamine Bayesian score = 25) into D2 dopamine inverse agonist compound 
3 (96% inhibition of D2 receptor at 10 1M; D2 dopamine Bayesian score = 92). 
The Tanimoto similarity between donepezil and compound 3 is only 0.35. 


barrier penetration. In our approach the desired multi-objective profile 
is defined a priori and then expressed as a point in multi-dimensional 
space termed ‘the ideal achievement point’. In this first example the 
objectives were simply defined as two target properties and therefore 
the space has two dimensions. Each dimension is defined by a Bayesian 
score for the predicted activity and a combined score that describes the 
absorption, distribution, metabolism and excretion (ADME) pro- 
perties suitable for blood-brain barrier penetration (D2 score = 100, 
ADME score = 50). We then generated alternative chemical structures 
by a set of structural transformations using donepezil as the starting 
structure. The population was subsequently enumerated by applying a 
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set of transformations to the parent compound(s) of each generation. 
In contrast to rules-based or synthetic-reaction-based approaches for 
generating chemical structures’**’~*, we used a knowledge-based 
approach by mining the medicinal chemistry literature**”’. By deriving 
structural transformations from medicinal chemistry, we attempted to 
mimic the creative design process** (Supplementary Fig. 4). Activity 
predictions were calculated for each of the enumerated compounds 
from all Bayesian models. Scores representing the likelihood of central 
nervous system (CNS) penetration and good ADME properties were 
calculated using the program StarDrop and combined into a single 
value. The predicted properties of the enumerated structures were then 
expressed as points in multi-dimensional space. The generated struc- 
tures were subsequently ranked by the distance (in multi-dimensional 
space) between the predicted properties for each structure and the ideal 
achievement point*’ (Fig. 1b). Compounds were filtered for novelty, 
Lipinski’s rule of five compliance** and synthetic accessibility’. The 
top 10,000 prioritized structures were selected for the next iterative 
cycle along with 500 random structures from the remaining popu- 
lation. The process was iterated until either a structure close to the 
objectives was discovered or no further improvements were achieved. 

Initially, we evolved a series of isoindoles and prioritized them 
using our achievement objectives as criteria (Fig. 1c, Supplementary 
Table 4 and Supplementary Fig. 5). Eight analogues were then syn- 
thesized and tested (Fig. 2, Supplementary Figs 2 and 6 and 
Supplementary Table 5), with all showing substantial D2 receptor 
affinities (K; values = 156-1,700 nM, Supplementary Table 3). 

The second highest ranking compound (compound 3)—chosen 
from the final population of evolved structures —exhibited the high- 
est D2 receptor affinity (K; = 156 nM). Thus, we successfully evolved 
the negligible D2 receptor binding activity of donepezil into a series of 
ligands with higher D2 affinities (Fig. 2). Functionally, compound 3 
was a dual D2 inverse agonist and D4 agonist (Supplementary Fig. 3). 
CNS penetration studies showed that compound 3 penetrates the 
brain as predicted, with an in vivo brain-blood ratio (BBR) of 0.5. 

Although the evolved compounds were selected for the D2 recep- 
tor objective, other predicted activities were not selected against. 
Accordingly, each of the generated compounds had a predicted poly- 
pharmacology profile. In general, this set of isoindole analogues was 
predicted to exhibit promiscuous profiles, with variable activities 
predicted for multiple serotonergic, adrenergic and dopaminergic 
receptor subtypes (Supplementary Tables 1 and 5). These predicted 
promiscuous profiles were subsequently confirmed (Fig. 2 and 
Supplementary Tables 2 and 3), and the predicted multi-target pro- 
files displayed excellent agreement with experimentally determined 
profiles, thereby implying that the approach can be applied to the de 
novo design of multi-target agents. 


Reducing anti-target activity 


The isoindoles exhibited moderately potent affinities for the o- 
adrenoceptors (K, values =0.9-3,577nM, mean K,=277nM, 
Supplementary Table 3 and Supplementary Fig. 2). Because - 
adrenoceptor antagonists can induce low blood pressure as a side 
effect they are considered ‘anti-targets’ to be avoided in drug design”. 
A common drug design optimization problem is to reduce such anti- 
target activity while maintaining desired on-target activities. We 
accordingly evolved the eight newly synthesized isoindoles towards 
a polypharmacological profile (5-HT,, serotonin receptor and D2-, 
D3- and D4-dopamine receptors) with selectivity over the three o- 
adrenoreceptors anti-targets (04, %1p and 0%;p) while maintaining 
CNS penetration. 

To compare evolutionary strategies, the isoindoles were optimized 
towards polypharmacology objectives with and without highly pre- 
dicted « activities filtered-out at each generation (Fig. 3a and Sup- 
plementary Fig. 7 and Supplementary Tables 6 and 7). In both 
optimizations benzolactams were ranked the highest (compounds 
1la and 10a respectively) (Fig. 3b and Supplementary Tables 6 and 
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7). Six benzolactam analogues were then synthesized on the basis of 
both sets of objectives (Supplementary Table 8 and Supplementary 
Fig. 5). Analogues of the benzo-6-lactam (3,4,-dihydroisoquinolin- 
1(2H)-one) (compounds 9a, 10a and lla) and benzo-g-lactam 
(2,3,4,5-tetrahydro-1-H-benzo[c]azepin-1-one) (compounds 9b, 10b 
and 11b) were synthesized for comparison because both ring systems 
were highly prioritized. 

Both the predicted and observed receptor activity profiles for 
the synthesized benzolactams are shown in Fig. 3b (see also 
Supplementary Tables 1 and 3 and Supplementary Fig. 2). The 2- 
pyridine-piperazine analogues (compounds 11a and 11b) have the 
lowest «, predictions and, indeed, exhibited the lowest «, affinities 
(mean K; = 1,131nM). In agreement with the models, compounds 
1la and 11b also have the lowest affinity for all D2-like dopamine 
receptors of the benzolactams tested. The dichloro-phenylpiperazine 
analogues (compounds 9a and 9b) exhibited slightly higher « and D2 
binding predictions, which were also confirmed experimentally. By 
contrast, the 2-methoxy phenylpiperazine analogues (compounds 
10a and 10b) exhibited potent affinities against the polypharmaco- 
logy profile of 5-HT,,4, D2, D3 and D4 receptors, but also had the 
highest o%; predictions and were confirmed as the most potent against 
the «, receptors (mean K; = 45.3 nM) (Fig. 3b). Receptor profiling of 
the benzolactam series revealed that compared to the isoindoles, from 
which the compounds were evolved, the benzolactams achieved the 
objective of an increased polypharmacology profile for 5-HT,4, D2, 
D3 and D4 receptors compared with the «,-adrenoceptors (Fig. 3b). 
The benzolactams are 11-fold more selective with respect to D2 recep- 
tors and 25-fold more selective with respect to the polypharmacology 
profile than the isoindoles. Notably, the benzolactam compound 9a 
penetrated the brain, (BBR = 5.9), as predicted. 

As the benzolactam series was not present in the ChREMBL database 
(release 1) used to build the Bayesian models, they constituted a novel 
chemical series for the system to discover. Notably, benzolactam 
derivatives have recently been independently synthesized and tested 
as potent D2 and D3 ligands*’; however, the broader receptor profiles 
of the compounds were not evaluated. This observation provides 
additional verification that the algorithm is capable of generating 


and prioritizing novel chemical structures equivalent to those devised 
by medicinal chemists. 


Potency optimization 

We next explored selectivity in the context of our multi-target objec- 
tives and asked whether we could optimize potent, selective, CNS- 
penetrant D4 receptor ligands starting from the chemical structure of 
donepezil. We executed the optimization in two stages; first we opti- 
mized for D4 potency and brain penetrability (Fig. 4a), and then for 
D4 selectivity (Fig. 4b). A series of 2-methylindoline derivatives with 
predicted high D4 activity was evolved from donepezil after six gen- 
erations (Fig. 4a and Supplementary Fig. 9). Notably, compounds 
belonging to a 2,3-dihydro-indol-1-yl chemotype were dominant in 
the prioritized set (Supplementary Table 9). Compounds 12 and 13, 
which both belong to the 2,3-dihydro-indol-1-yl class, were then 
selected for testing (Supplementary Figs 5 and 8). The highest ranking 
compound, 12, was inactive, whereas compound 13, the third-highest- 
ranking design out of the final population, was the most potent D4 
ligand among all tested compounds (D4 Kj; = 8.9nM). Through the 
process of optimization, compound 13 represents a 69-fold increase in 
affinity over donepezil. In contrast to the isoindole and benzolactams 
analogues, compound 13 is predicted to be a highly selective D4 ligand 
with 95-fold selectivity over 5-HT 3 and weak affinities greater than 
1 uM for only five other receptors in our panel of G-protein-coupled 
receptors (GPCRs) (Fig. 2). Importantly, and as predicted, compound 
13 is highly CNS penetrant (BBR = 7.5). 

To verify that the predicted properties of selectivity, potency and 
CNS penetration resulted in D4 receptor activity in vivo, we evaluated 
the effects of compound 13 treatment on behaviour in wild-type and 
D4 receptor knockout (D4R-KO) mice (Fig. 4c-f), as well as in pro- 
protein convertase 7 (PC7 encoded by Pcsk7) knockout mice (Sup- 
plementary Fig. 10a-d) that display a similar D4R-KO phenotype. 
Although open field locomotor activity declined in vehicle-treated 
D4R wild-type (DR4-WT) animals (Fig. 4c), it remained high and 
showed little habituation in D4R-KO mice. Although the treatment 
with 0.7mgkg ‘ of compound 13 was without significant effect in 
either genotype, the 1 mgkg * dose reduced locomotion at 0-20 min 
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Figure 3 | Reducing og, anti-target activity by evolutionary design. 

a, Summary of the evolution of the prioritized benzolactam analogues 
(compounds 11a and 11b) from a parent isoindole analogue (compound 

5). The full evolutionary pathway from compound 5 to 11b is shown in 
Supplementary Fig. 7. b, Comparison of polypharmacology profiles for the 
Bayesian model score and experimental binding affinities (K,) of compound 
5 and benzolactam analogues (compounds 9a-11b) for the seven target 
objectives (1,4, %1p and %p compared to 5-HT},, D2, D3 and D4 receptors). 
On average, the selectivity ratios for the D2, D3, D4 and 5-HT}, receptors for 
the synthesized benzolactams over the o; receptors are 2.8-, 59-, 58- and 312- 
fold, respectively. In comparison, the average selectivity ratios for D2, D3, D4 
and 5-HT}, receptors for the isoindole analogues over the %, receptors are 
0.27-, 2.1-, 1- and 14-fold, respectively. With respect to the algorithm 
prioritization for the benzolactam analogues (Supplementary Table 8) against 
the multi-target objectives, the order of prioritization matches the 
experimentally determined order for the analogues (dichloro phenylpiperazine 
> 2-methoxy phenylpiperazine > 2-pyridine piperazine). 


in D4R-WT animals. The same dose had no effect in D4R-KO mice. In 
the centre zone, vehicle-treated D4R-KO mice spent more time in this 
area at 21-60 min than D4R-WT controls (Fig. 4d). Centre time in 
D4R-WT animals was enhanced at 0-20 min with both doses of com- 
pound 13; in D4R-KO mice it was attenuated at 41-60 min with the 
1mgkg ‘ dose. In the hole-board test, head-poking was increased in 
vehicle-treated D4R-KO animals compared to D4R-WT controls 
(Fig. 4e). In D4R-WT mice compound 13 augmented head-pokes in 
a dose-dependent fashion; only the 1 mgkg * dose attenuated head- 
poking in mutants. In the zero maze, open-area time was increased in 
vehicle-treated D4R-KO mice relative to D4R-WT controls (Fig. 4f). 
Treatment of lmgkg ' of compound 13 selectively increased D4R- 
WT open-area times to similar levels seen with the D4R-KO mice. 
With regards to PC7 mice, PC7-WT and D4R-WT responses were 
similar (Supplementary Fig. 8a-d). Although behaviours in PC7- 
KO vehicle controls essentially phenocopied those in vehicle-treated 
D4R-KO mice, compound 13 normalized PC7-KO responses to those 
of PC7-WT controls. By comparison, D4R-KO mice were largely 
unresponsive to compound 13—demonstrating high D4R selectivity. 
Nonetheless, compound 13 did not seem to be absolutely selective 
because in D4R-KO animals the 1 mg kg ' dose attenuated head-poking 
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in the hole-board test, implying some possible off-target actions at 
increased doses. 


Automated invention 


Using compound 13, we further expanded our objective to evolve 
ligands that were highly selective for D4 dopamine, CNS penetrant 
and a new chemotype. The evolution of compound 13 against these 
objectives resulted in the design of new morpholino compounds 
(Fig. 4b and Supplementary Fig. 11 and Supplementary Tables 10 
and 11). Compounds with the new isoindol-1-yl-ethyl-morpholino 
backbone were prominent in the prioritized final generation population 
of 10,000 structures (ranked fifth, sixth and ninth of the top 10 com- 
pounds in Supplementary Table 10), whereas most known D4 ligands 
are 1,4-disubstituted aromatic piperidines and piperazines (1,4-DAPs). 
However 1,4-DAPs are rather promiscuous substructures common in 
ligands for many biogenic amine GPCRs’. Therefore, the isoindol- 
1-yl-ethyl-morpholino analogues represent a new D4 chemotype. 

A library of 24 morpholino analogues (compounds 14-29) was 
then synthesized and profiled against our GPCR panel (Supplemen- 
tary Table 11 and Supplementary Fig. 5). Individual R and S morpho- 
lino enantiomers were synthesized and assayed separately (chirality 
designated by prefix). To reduce complexity further, direct analogues 
with and without the carbonyl oxygen were synthesized (for example, 
compounds 20 and 21), as this atom was not predicted to be essential 
to the overall D4 selectivity profile (Fig. 2). 

The assays confirmed the predictions that the new morpholino 
compounds are generally highly selective for the D4 receptor com- 
pared with the other tested receptors (Fig. 2). Seventeen of the com- 
pounds had affinities for the D4 dopamine receptor, ranging from 
K, = 90 nM (compound (S)-27) to K; = 5,526 nM (compound (S)-18) 
with eight exhibiting affinities less than 1 1M for D4 (compounds 
(R)-15, (S)-19, (S)-21, (R)-22, (S)-23, (S)-26, (S)-27 and (S)-28). 
Compounds containing the ethanone linker-group were generally less 
active than those with the ethyl linker. For compounds with the ethyl 
linker, the S enantiomer was more potent than the R enantiomer. 
Functional assays of an exemplar compound (R)-22 indicated inverse 
agonism at the D4 receptor (Supplementary Fig. 3). 

In agreement with the design objectives, the morpholinos displayed 
exquisite selectivity for the D4 dopamine receptor. Excluding the 
dopamine receptors, low positive Bayesian scores were observed for 
eight of the compounds against 5-HT,, serotonin receptors, 16 of 
the compounds against the 5-HT2,4/p/c serotonin receptors, and 
almost all had very low scores for 5-HT; serotonin receptors. The 
off-target trends were confirmed when the compounds were profiled 
(Fig. 2). The morpholino compounds, on average, bound to 3.4 targets 
(including D4 at K;< 10M) compared to 15.8 targets for the iso- 
indole and benzolactam compounds. Seven of the active compounds 
had off-target activities for only 1 of the 20 receptors tested. Compound 
(S)-26 is the most selective compound with no measured affinity for any 
other screened receptor. Four compounds possessed both relatively 
high affinity (D4 K, <1 |1M) and two or fewer off-target activities out 
of the 20 GPCRs profiled ((S)-21, (S)-26, (S)-27 and (S)-28). 

The morpholino series thus represents a new class of highly selective, 
brain-penetrant, D4 dopamine receptor ligands. Compounds (S)-27 
(D4 K,=90nM; D1 K,=5852nM; BBR=2.0) and (S)-21 (D4 
K, = 182 nM, 5-HT2, K; = 3,545 nM) qualified as lead compounds that 
fulfilled all of our design objectives of novelty, high affinity for the D4 
dopamine receptor, exquisite selectivity and CNS penetration. Clearly 
the automated design of a new class of ligands with a desired multi- 
target profile demonstrates that the method is able to generate novel, 
drug-like lead compounds directly by automated design. 


Drug design from knowledge 

We focused on the polypharmacology of bioaminergic GPCRs as a 
convenient test case, owing to the importance of multi-target profiles 
at these receptors for a variety of neuropsychiatric indications*. In 
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Figure 4 | Evolution of D4 dopamine ligands from donepezil. a, Summary 
of the evolution of donepezil (compound 1) (D4 dopamine Bayesian 

score = 26, D4 K, = 614nM) into D4 dopamine inverse agonist compound 
13 (D4 dopamine Bayesian score = 112, D4 k; = 8.9nM). The Tanimoto 
similarity between donepezil and compound 13 is only 0.26. The full 
evolutionary pathway from donepezil to compound 13 is shown in 
Supplementary Fig. 9. b, Summary of the evolution of selective novel D4 
dopamine ligands. Compound 13 is further evolved by selection for novelty and 
D4 selectivity into the morpholino analogues 18, 20 (R and S), 21 

(R and S) and 27 (Rand S). The full evolutionary pathway from compound 
13 to 27 is shown in Supplementary Fig. 11 c-f, Behavioural analysis of a novel 
D4 dopamine ligand in D4 receptor knockout (D4R KO) mice. D4R-WT, 
D4R-wild type. c, Distance travelled in the open field over 60 min. Mice were 


principle, the approach is applicable to all drug-target classes, limited 
only by the requirement for sufficient structure—activity data to create 
useful models”"’*. To extend polypharmacology profiling and hence 
de novo design it will be necessary to develop inference methods to 
build predictive bioactivity models that integrate all available struc- 
ture-activity relationships, protein structure and protein sequence 
information together and combine data from diverse scoring func- 
tions into predictive frameworks”. 

De novo, automated compound design against multi-target profiles 
provides a powerful new approach for discovering new ligands and 
drug leads. The method is particularly useful as a new source of leads 
for polypharmacology profiles. 


METHODS SUMMARY 


Adaptive optimization. A library of historical chemical structure transforma- 
tions was derived by mining the ChEMBL database”. Each transformation was 
systematically applied to every parent compound in each generation. Bayesian 
activity models’*° were calculated for more than 784 proteins based on the ECFP6 
fingerprint representation” of chemical structures of structure-activity data in 
ChEMBL. Each compound in every generation was scored against the panel of 
Bayesian models. ADME properties were calculated using StarDrop (Optibrium). 


given intraperitoneal injection of vehicle or 0.7 or 1mgkg * of compound 
13 and tested immediately over 60 min. d, Time spent in the centre zone in the 
open field. e, The numbers of head-pokes in the hole-board test. Animals were 
injected with vehicle or 0.7 or 1mgkg * of compound 13 and tested 30 min 
later over 10 min. f, Percentage of time in the open areas in the zero maze in 
D4R mice. Animals were administered vehicle or compound 13 and tested 
30 min later for 5 min. n = 8-14 mice per treatment/condition. 

*P < 0.05, wild-type compared with D4R-KO mice or compared to the 
D4R-KO mice at the same time-point; P < 0.05, comparisons within genotype 
to the vehicle; {P < 0.05, comparisons between 0.7 and 1 mg kg! compound 
13; 8P < 0.05, compared to the 0-20 min time-point. All error bars are 

mean + s.e.m. 


Synthetic accessibility was assessed using a previously published method* 
based on the statistical representation of fragments in ChEMBL and molecu- 
lar complexity. The algorithm was programmed in Pipeline Pilot (version 7.5, 
Accelrys). 

Chemical synthesis. Details of the purchase, synthesis and purification of all 
compounds are provided in the Supplementary Information. 

Experimental testing. The predicted receptor profiles of all the compounds 
were assessed using standard radioligand binding and functional assay methods 
at the National Institute of Mental Health Psychoactive Drug Screening Program. 
The hERG assay was performed as previously described””. Metabolic stability 
was assessed as previously described** and brain penetration was assessed by 
determining the ratio of tested compound between brain and blood at a set 
time point following intraperitoneal or intravenous administration. For beha- 
vioural studies, compound 13 was dissolved in a 0.1% N,N-dimethylacetamide 
(DMA) with 15% 2-hydroxypropyl-f-cyclodextrin (Sigma-Aldrich) solution and 
injected intraperitoneally immediately before placement into the open field 
or 30min before the zero-maze or hole-board tests as described”? using 
AccuScan (AccuScan Instruments), Noldus Observer XT 9.0 (Noldus Techno- 
logies), and Clever Sys TopScan software (Clever Sys), respectively. All beha- 
vioural data are presented as means + s.e.m and were analysed with SPSS 20 
(IBM). Details of the in vivo experiments are described in the Supplementary 
Information. 
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Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Data sets. All machine learning and data mining of the medicinal chemistry 
structure-activity data were conducted on the ChEMBL database (release 1) 
and a pre-release (StARLite version 31)?*. The ChEMBL database contains 
(release 1) more than 440,000 compounds abstracted in J. Med. Chem. and 
Bioorg. Med. Chem. Lett. from 1980 to May 2008. The ChEMBL database is 
available for download from the EBI. 
Chemical transformations. A database of chemical transformations was derived 
from systematically comparing sets of analogue compounds in ChEMBL™. Sets of 
analogues with defined structure-activity relationships were identified in 
ChEMBL* usually from individual journal articles. The transformations database 
was seeded with a set of common chemical transformations derived from medi- 
cinal chemistry knowledge. The transformations database was then expanded by 
systematically applying all existing transformations to each of the structures 
associated with each of the journal articles in ChEMBL. The resulting set of 
transformed compounds was compared to the published analogues. Analogues 
that were not present in the transformed set highlighted potential transforma- 
tions that were missing from the transformation database, and were subsequently 
added to the database. This iterative mining method attempts to regenerate all the 
reported structures of every medicinal chemistry publication reported in 
ChEMBL. The chemical transformations were encoded in RXN format. The 
procedure was implemented in Pipeline Pilot. The current database contains 
more than 700 unique structural transformations. 
Bayesian models. Predictive polypharmacology profiling was undertaken using 
Bayesian activity models, based on our previously published approach’. The 
Bayesian method for polypharmacology profiling was chosen as it provided both 
good performance for noisy data sets and a high speed of calculation’. High 
confidence models were built using ChEMBL (release 1). Activity data were 
filtered to keep only activity end-point points that had half-maximum inhibitory 
concentration (ICs 9), half-maximum effective concentration (ECs9) or K; values 
and where the ChEMBL confidence score was at least seven (protein assignment 
was direct or a homologue). A compound was considered active when the mean 
activity value was below 10M. All inactive compounds were assigned to the 
target ‘none’. Following this procedure, 133,061 compounds remained with 
215,967 activity end-points, which were used for model building. Multiple cate- 
gory Laplacian-modified naive Bayesian models were built with ECFP6 repre- 
sentations” for 784 targets. For each model the data were split in two for the 
validation step: compounds were clustered and assigned a cluster number. 
Clusters with an odd number were assigned to the test set, and the clusters with 
an even number were assigned to the training set. Models were built with the 
training set, and the test set was scored. The training set was scored using its own 
model as comparison. Finally, a model was built with all data and scored against 
itself—the training set and whole set should provide similar validation statistics. 
Statistics on the performance of the models are described in Supplementary Table 
12. The results for the model containing all 785 targets were very similar to the 
models for the receptor subsets. Two analyses were used to assess the perform- 
ance of the different models. The first analysis provides an overall score and does 
not need to specify a cut-off for distinguishing active from inactive compounds. 
The area under the receiver operating characteristic (ROC) curve provides an 
indication of the ability of the model to prioritize active compounds over inactive 
compounds. The ROC curve is the plot of the true positive versus the false positive 
rate. However, it does not provide information on early enrichment, which was 
important in studies such as the present one in which only the top-ranking 
compounds were considered. Therefore the Boltzmann-enhanced discrimination 
of ROC (BEDROC)* was used, which solves the early enrichment issue by adding 
a weight to compounds recognized early. BEDROC was derived from the robust 
initial enhancement, and the sum of log of ranks test**, which provided a statis- 
tical test to assess which method performs better than random. The percentage of 
active compound retrieved in the top 5% is also calculated (recall = 5%). The 
second analysis required a cut-off to make the distinction between active and 
inactive, as they varied with the rank of the compounds. For each model, the 
specificity (true negative rate), sensitivity (true positive rate), false positive rate, 
false negative rate, precision, F-measure and Matthews correlation coefficient 
(MCC) were calculated at different cut-off values. The cut-off providing the best 
MCC score was used, as it was shown to provide better performance” 
(Supplementary Fig. 12). The quality of the models was assessed using an internal 
leave-one-out validation: one compound was part of the test set and was scored 
using the remaining data as the training set. Then the area under the ROC curve 
was calculated (Supplementary Fig. 13). A cut-off score was calculated to min- 
imize the sum of the percentage misclassified for category members and for 
category non-members and used to classify compounds in the contingency table. 
Anall data model for dopamine receptors only was built using data from a pre- 
release of ChEMBL (StARLite version 31), with similar numbers of compounds 
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and end-points. The model was built without considering the confidence level of 
target assignment to gather as much data as possible. This model was used for 
initial calculation on the evolution of the isoindole series and the 2,3-dihydro- 
indol-1-yl series. The quality of the models was assessed using the same proce- 
dures as described earlier. The results from the all data models and the high 
confidence models were very similar (for example, D2 models R? = 0.998, D4 
models R? = 0.984). 

Profile prediction probabilities. The cut-off for a good prediction came from the 
validation steps for the model. From the test set, the cut-off value providing the 
best MCC value was used. For the 5-HT}» receptor, a cut-off of zero was selected. 
For each ligand-target association, the probability of success was 0.5 (active or 
inactive where activity was defined as K; < 10 1M). To test whether the profile 
predictions were better than random, an exact binomial test was performed using 
R (version 2.8.1) (http://www.r-project.org), and the cumulative binomial pro- 
bability was calculated. 

Adaptive optimization. The adaptive optimization procedure involves defining 
a parent population of compounds (Pp), of at least one molecule, and a set of 
n ideal achievement objectives {xo's .-->Xo"}, where nis at least 1. The parent 
population of compounds for each generation (G) is defined as Ppig = »), in which 
n= 0 for the initial population. All of the members of the parent population (Pp) 
are subjected to all possible transformations, to maximize the pool of molecules in 
the transformed population (P;). Each member of the transformed population is 
scored using the calculated predictions and molecular properties and ranked 
relative to the achievement objectives {xo', ...X0"}. Pareto ranking is acommon 
method for prioritizing multiple criteria’. The Pareto frontier maps a surface 
in which all solutions are considered equivalent (non-dominant)—where an 
increase in one objective leads to a decrease in at least one or more other objec- 
tives. However, finding a Pareto optimal solution becomes difficult when many 
objectives are considered”*, hence a vector scalarization procedure is used”. 
Multi-objective prioritization is performed using vector scalarization, by describ- 
ing the calculated parameters for a compound (A) as a multi-dimensional coordi- 
nate. The results are ranked by the magnitude of the vector ||a|| between the 
multi-dimensional coordinates of predicted values of the chemical structure of a 
compound (A) and the defined ideal achievement objective point, (O), with the 
shortest vector length closest to the ideal in multi-dimensional space, being 
ranked the highest: 


n 


llal=,| 9. @&o? —x4?? 
p=l 
in which the ideal achievement objective point has the coordinates Goh esos 
and calculated values of the compound A form the coordinates (x4), 009%”). 


For subsequent generations, prioritized individuals with calculated properties 
and parameters that satisfy defined thresholds are assigned to an elite population 
(Pz), and those that fail are assigned to a non-elite population (Py), of which n 
random members form a random population (Px). For all the calculations per- 
formed, P; has a maximum size of 10,000 and Pg a maximum size of 500. The 
new parent population (Ppig = n+ 1)) is created by merging Py and Pp (that is, 
Pog =n+1) = PeG=n) + Pag =n))- This new population is subjected to another 
transformation process. The new parent population Pp from each generation is 
also added to a combined pool of all observed parents (Ppau). 

The population of transformed compounds from the last iteration and the 
pooled parent population are combined and all duplicates and compounds failing 
structure valency rules are removed to produce a merged population (Py) of 
unique members (Py = Ppa + Py). Properties and parameters (for example, 
Bayesian activity models values, physicochemical properties and predicted 
ADME properties) are calculated for each individual in Py. Each member popu- 
lation Py is evaluated against the achievement objectives {xo',...X0"}, as 
described above. The entire process is repeated until a stop condition is satisfied. 

Novelty is assessed by comparing the generated compound with compounds in 
ChEMBL, either as an exact match or by comparison of the Murcko framework", 
depending on whether the objectives are defined in terms of new compounds or 
new chemotypes. Novelty is filtered depending on the goals. ADME properties 
and CNS penetration are calculated using previously publishing Gaussian process 
models*”** as implemented in StarDrop (Optibrium). A synthetic accessibility 
score, representing historical synthetic knowledge, is calculated using a prev- 
iously published algorithm”. The synthetic accessibility score combines the 
observation of fragments in ChEMBL and a complexity penalty. A limitation of 
the ECFP6-Bayesian prediction method is that fingerprints cannot distinguish 
stereochemistry if the stereochemistry is not encoded in the training data. In the 
ChEMBL database only 43% of the chiral compounds with chiral centres have 
their stereochemistry fully defined, thus it is not possible to distinguish between R 
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and S enantiomers. To reduce the complexity of synthesis, compounds with two 
or more chiral centres were filtered from the final population. 

Receptor profiling. The detailed experimental protocols for the radioligand and 
functional receptor assays are available on the NIMH PDSP website (http:// 
pdsp.med.unc.edu/UNC-CH%20Protocol%20Book.pdf). 

hERG assay. Activity of the potassium channel hERG was assayed by the patch 
clamp method on a PatchXpress platform and by FluxOR TI" assays. Assays were 
performed as previously described”. 

Metabolic stability assay. Metabolic stability was assessed, generating the in vitro 
intrinsic clearance (Cl;) following incubation of test compound with mouse hepa- 
tic microsomes. The assay was performed as previously described”. 

Brain penetration measurement. Mice were housed under standard conditions: 
12h:12h light-dark cycle and food and water available ad libitum throughout the 
study. The Drug Discovery Unit at the University of Dundee is dedicated to the 
humane care, maintenance and use of research animals and maintains compliance 
with UK Home Office regulations. All experiments were approved by the local 
ethical review committee. The ratio of test compound between brain and blood was 
assessed following intravenous administration to the female NMRI mouse. 
Behavioural testing. Adult male and female C57BL/6J, D4R-KO mice (Jackson 
Laboratory), and PC7-KO mice were used. PC7 mice were backcrossed with 
C57BL/6] mice for more than ten generations. Animals were maintained in a 
humidity- and temperature-controlled room under a 14h:10h light-dark cycle 
(lights on at 8:00). All studies were conducted during the light cycle, between 
10:00 and 16:00 in the following order: zero maze, open field and hole-board. All 
tests were separated by at least by 7 days. Animals were assigned to vehicle (0.1% 
DMA with 15% 2-hydroxypropyl-f-cyclodextrin (Sigma-Aldrich)) or compound 
13-treated groups, and were maintained in these groups throughout the experi- 
ments. Water and laboratory chow were supplied ad libitum. All experiments 
were conducted with an approved protocol from the Duke University Institutional 
Animal Care and Use Committee and according to the NIH Guide for the Care and 
Use of Laboratory Animals. To overcome the high intrinsic clearance in mouse 
hepatic microsomes of compound 13 (Cl, = 13.8 ml min ! g'), mice were 


injected intraperitoneally with vehicle, or 0.7 or 1mgkg ' of compound 13 and 
placed immediately into the open field for 60 min as described”. Activity was 
monitored as distance travelled and time spent in the centre zone. In the hole- 
board and zero-maze tests*”*° mice were injected intraperitoneally with vehicle or 
compound 13 and tested 30 min later. Hole-board test responses were video taped 
for 10 min using high-resolution low-light cameras (Panasonic) and were scored 
using the TopScan program (Clever Sys) for the rate of head-pokes into the 16 
holes. Zero maze behaviours were video taped over 5 min and were scored by 
trained observers blinded to the genotype, sex and treatment-condition of the 
animals using the Observer XT10 program (Noldus Information Technology) 
for the percentage of time in the open areas. All behavioural data presented as 
mean + s.e.m. and were analysed by analysis of variance (ANOVA) and repeated 
measures ANOVA followed by Bonferroni corrected pair-wise comparisons (IBM 
SPSS 20). P< 0.05 was considered significant. 
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Formation of the widest binary stars from dynamical 
unfolding of triple systems 


Bo Reipurth! & Seppo Mikkola” 


The formation of very wide binary systems’ *, such as the a Centauri 
system with Proxima (also known as @ Centauri C) separated from 
o Centauri (which itself is a close binary A/B) by 15,000 astronomical 
units* (1 AU is the distance from Earth to the Sun), challenges current 
theories of star formation, because their separation can exceed the 
typical size of a collapsing cloud core. Various hypotheses have been 
proposed to overcome this problem, including the suggestion that 
ultrawide binaries result from the dissolution of a star cluster—when a 
cluster star gravitationally captures another, distant, cluster star*”’. 
Recent observations have shown that very wide binaries are frequently 
members of triple systems*° and that close binaries often have a dis- 
tant third companion’’”’. Here we report N-body simulations of the 
dynamical evolution of newborn triple systems still embedded in their 
nascent cloud cores that match observations of very wide systems'***. 
We find that although the triple systems are born very compact—and. 
therefore initially are more protected against disruption by passing 
stars'®’’—they can develop extreme hierarchical architectures on 
timescales of millions of years as one component is dynamically 
scattered into a very distant orbit. The energy of ejection comes 
from shrinking the orbits of the other two stars, often making them 
look from a distance like a single star. Such loosely bound triple 
systems will therefore appear to be very wide binaries. 

Evidence is building that stars often, and possibly always, are formed 
in small multiple systems'*””. Dynamical interactions between mem- 
bers of such systems lead to close triple encounters in which energy 
and momentum is exchanged, typically causing the disintegration 
of the triple system, with the escape of a single component (most 
frequently the lowest-mass member) and the formation of a stable 
binary”. A bound triple with a hierarchical architecture may also 
result, but only if it forms in the presence of a gravitational potential 
can such a triple system achieve long-term stability”. However, this is 


a Stable bound triples 


frequently fulfilled for newborn triple systems, since break-up typically 
occurs in the protostellar phase, when the newborn stars are still deeply 
embedded in their nascent cloud cores”. 

We have used an advanced N-body code to run 180,218 simulations 
of a newborn triple system placed in a gravitational potential” (for 
technical details see Supplementary Information). Figure la shows 
results for the 13,727 stable hierarchical systems that are formed in 
the 180,218 simulations. The blue dots mark the semimajor axes of the 
inner binaries, and the red dots indicate the semimajor axes of the 
outer components relative to the centre of mass of the inner binary. 
The distant components have semimajor axes that span from hun- 
dreds to several thousands of astronomical units, with a small but not 
negligible number of cases reaching several tens of thousands of astro- 
nomical units and beyond. 

We run the simulations for 100 million years, and classify the out- 
come at one million years (1 Myr), 10 Myr and 100 Myr into stable 
triples, unstable triples and disrupted systems. If the outer orbit is 
hyperbolic, then the system is disrupted. If not, then the system is 
(at least temporarily) bound, and a stability criterion is applied’. If 
the system passes this test, it is classified as stable. If not, it is still bound 
but internally unstable, and will sooner or later disrupt. Figure 1b 
shows the semimajor axis distribution for 56,957 bound but unstable 
triple systems. 

The number of systems in each category is merely an estimate, 
because a rigorous theory of stability is not mathematically possible, 
given that the three-body problem is non-integrable. Hence we see that 
the number of stable triples at 1 Myr, 10 Myr and 100 Myr is not 
completely constant, but declines by a few per cent over time. The 
number of unstable triples, on the other hand, declines dramatically 
with age, by a factor of three or more from 1 Myr to 100 Myr (Fig. 2a). 
At 1 Myr, 39% of systems are bound (stable and unstable), at 10 Myr 
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Figure 1 | Semimajor axes of stable and unstable bound triple systems. 

a, The semimajor axes of the outer and inner pairs in stable bound triple 
systems. Filled blue circles are the inner binaries in stable hierarchical triple 
systems; filled red circles are the more distant singles in stable hierarchical triple 
systems. b, The semimajor axes of the outer and inner pairs in unstable bound 
triple systems. Turquoise circles are binaries in unstable bound triple systems; 
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green circles are singles in unstable bound triple systems. In both panels, the 
semimajor axes refer to orbital parameters for systems that still remain bound 
at 1 Myr; at later times many of the unstable systems will have disrupted. For the 
widest systems, the distant bodies have not yet reached their extreme apastron 
distances. 
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Figure 2 | Statistical properties of stable, unstable, and disrupted triple 
systems. The colour scheme is the same as in Fig. 1. a, Histogram with number 
of stable hierarchical, unstable hierarchical and disrupted triple systems at 

1 Myr, 10 Myr and 100 Myr. The stable systems are essentially constant, 
whereas many of the unstable systems disrupt. b, The distribution of semimajor 
axes for both inner and outer binaries that are bound at 1 Myr. The grey dashed 
line shows the sum of all (inner and outer) binaries. c, The distribution of 
eccentricities e for bound inner and outer binaries at 1 Myr. It is evident that 
highly eccentric systems are common, and the number of triple systems is a 
growing function of eccentricity for all but the stable outer systems, which peak 
around e ~ 0.7. Systems with very high eccentricity tend to have smaller 
periastron distances a(1-e), leading to the possibility of perturbations that, 
after one or more close periastron passages, eventually lead to break-up. Hence, 
the decline seen at high eccentricities for bound stable systems (red) is 
compensated by an increase among the unstable outer systems (green). The 
eccentricity distribution for the inner binaries will evolve significantly if 
circumstellar material is still present at birth or owing to Kozai resonance 
(when the inclination and eccentricity of a perturbed orbit oscillate 
synchronously). d, The distribution of semimajor axes of the outer components 
in triple systems show a very strong dependence on the eccentricity. On 
average, wide binaries (1,000 < a < 10,000 av) get increasingly wide as the 
eccentricity increases. For very wide binaries (a > 10,000 au, marked in red) 
this correlation becomes even more pronounced. Although very wide binaries 
can be found with modest eccentricities (e ~ 0.3-0.4), the majority have 
eccentricities exceeding 0.9. 
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Figure 3 | Stable and unstable hierarchical triple systems at 1 Myr and at 
100 Myr, and the maximum extent a(1 + e) ofa triple system as a function of 
time. a, The total mass of the close binary (M4 + Mg) is plotted against the 
mass of the distant third body Mc at an age of 1 Myr for all wide systems with 
outer semimajor axes exceeding 1,000 av. Systems that are classified as unstable 
are marked green, and systems that are stable over long timespans are marked 
red. The figure is divided into two areas, in one half most of the system mass 
resides in the binary, whereas in the other half the single dominates the system. 
A line indicates where systems with three identical bodies lie. b, As for a but for 
wide systems at an age of 100 Myr. Stable and unstable systems can be found all 
over the diagram, but with a strong preference for unstable systems to have a 
dominant binary, while stable unequal systems have a slight preference for a 
dominant single. At young ages hierarchical triple systems therefore frequently 
have dominant binaries. c, A system with outer period P of 2 Myr will for the 
first time reach apastron after 1 Myr. All systems in the grey-shaded area have 
reached apastron at least once within 1 Myr. During that time, no system has 
reached a separation of more than 50,000 Av. The dotted blue line shows how 
far the centre of mass of a triple system has moved in a given amount of time 
assuming a velocity of 1kms |. Values are shown for 1 Myr and 10 Myr. The 
widest systems, which take tens or hundreds of millions of years to unfold, will 
have moved away from the denser and more perilous environment in which 
they were born before being fully unfolded. 
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Figure 4 | Frequency of stable, 
unstable and disrupted triple 
systems as a function of projected 
separation. The left panels show the 
number of bound stable (red), bound 
unstable (green) and unbound (blue) 
triple systems as a function of 
projected separation for three 
different ages. The black dotted line 
indicates the sum of all three. This is a 
snapshot at 1 Myr, 10 Myr and 
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binaries projected onto the sky for all 
180,218 simulations. The right panels 
show the fraction of bound stable 
(red), bound unstable (green), and 
unbound (blue) triple systems relative 
to the total number of triple systems as 
a function of projected separation for 
three different ages. These diagrams 
allow a statistical determination of the 
state of an observed triple system. At 
1 Myr the bound systems dominate 
but only out to separations of 

0.1 parsec, after which the disrupted 


pairs strongly dominate. At these early 


Fraction 


ages, the unstable triples are more 
common than stable triples by factors 
of three or more. As time passes, the 
unstable triples break up and for 
systems with observed separations of 
less than 4,000 au the stable triples 
slightly outnumber the unstable ones 
at 10 Myr. At the same time, the 
disrupted systems (which will appear 
as common proper motion pairs) are 
moving apart and become dominant 
only at separations exceeding 4 parsec. 


Projected separation (au) 


this number has decreased to 18%, and at 100 Myr it is 12%. This 
number will continue to decrease until only the stable triple systems 
remain, which is about 7%. This last number compares well with the 8% 
of triple systems (the mean of all spectral types) observed in the field’®, 
suggesting that most star-forming events must start as triple systems. 

The reason for the difference in distribution of semimajor axes 
between stable and unstable triple systems (Fig. la, b) lies in their 
orbital parameters. Figure 2b shows the separation distribution func- 
tion of stable outer (red) and stable inner (blue) and unstable outer 
(green) and unstable inner (turquoise) systems at an age of 1 Myr. 
Figure 2c shows the distribution of eccentricities among the bound 
triple systems. The primary reason that some systems are stable and 
others unstable is that the stable systems are well separated even at 
periastron, when the three bodies have their closest approach. In con- 
trast, the unstable systems are much more likely to suffer perturbations 
at closest approach, ultimately leading to their disruption. This is 
reflected in the eccentricity of the systems; see Fig. 2 legend for details. 

There are three main parameters that control the stability of a triple 
system: the semimajor axis, the eccentricity e, and the ratio of periastron 
distance of the outer binary to the apastron distance of the inner binary 
(periastron and apastron are the points in the orbit when the stars are 


S 5 Finally, at 100 Myr, the stable triples 
ay aa dominate out to separations of about 
a @ 8,000 AU, but for larger separations the 

very widest binaries are mostly bound 
but unstable systems, while the 
disrupted triples play only a minor 
part at these more advanced ages. The 
two vertical lines mark separations of 

1,000 AU and 10,000 av. 


Projected separation (av) 


closest and most distant, respectively). If the outer periastron distance 
becomes smaller than roughly 5-10 times the inner apastron distance, 
then the system will eventually break up. Systems with close inner 
binaries thus have a larger chance of achieving stability. For the outer 
binary, the relation between eccentricity and semimajor axis is shown 
in Fig. 2d. Wide binaries (1,000-10,000 au) are found with all eccent- 
ricities except the very smallest, although there is a clear preference for 
larger eccentricities. For the very wide binaries (>10,000 Av), in con- 
trast, the eccentricities tend to be extreme (red dots in Fig. 2d). The 
reason they can survive is that their periastron distance a(1 - e) is also 
large, thanks to their semimajor axes a also being extremely large, 
although a few are stable even with moderate eccentricity, presumably 
because they have unusually small inner binaries. 

Wide stable and unstable triple systems differ in one important 
respect. Figure 3a,b shows how total binary mass relates to the mass 
of the distant third body for systems wider than 1,000 au at ages of 
1 Myr and 100 Myr. A large population exists at 1 Myr with members 
that are either all three approximately of the same mass, or with the 
distant third member being of very low mass. This population, how- 
ever, is largely unstable (green), and so has mostly disappeared 
after 100 Myr, except for the very low-mass systems. The reason that 
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systems with a dominant binary and a light single are more unstable 
than the opposite configuration is probably that massive binaries can 
more easily alter the orbit of the third body near periastron, eventually 
leading to disruption. Stable triple systems (red) are much more uni- 
formly distributed across Fig. 3, although with a preference for mem- 
bers of very-low-mass systems to be approximately of the same mass, 
and a slight preference for systems with dominant singles rather than 
dominant binaries. Such time-dependent properties of wide triple sys- 
tems may be a dynamic signature of the triple decay mechanism. 

Our simulations do not take into account that there may be fur- 
ther orbital evolution of the inner binary when the decay from the 
non-hierarchical to the hierarchical configuration occurs during 
the protostellar phase (as it does for more than 50% of simulations”). 
In that case, viscous evolution will cause further in-spiralling of the 
inner binary, leading to the formation of spectroscopic binaries. Gas- 
induced orbital decay can ultimately lead to the merger of the binary 
components in a non-negligible number of cases”. It follows that 
although wide binaries formed through triple decay initially consist 
of three stars, during the pre-main sequence phase they may evolve 
into a true wide binary containing only two stars. 

The results presented here refer to the birth population of binaries, 
at an age of 1 Myr. The orbital parameters of a triple system are estab- 
lished at the moment when a stable hierarchical triple is formed. But 
since the birth configuration of a triple system is compact, it will take 
half an orbital period of the outer component before the triple system 
reaches its first apastron passage and attains its maximum extent 
a(1 + e). We call this the initial unfolding time of the newborn triple 
system. Many wide systems have not unfolded fully at 1 Myr, and the 
most extreme wide systems will take tens to hundreds of million years 
to unfold, and are thus more protected against disruption by passing 
stars'®’” than if triple systems were born with such enormous separa- 
tions in their crowded natal environments (Fig. 3c). See Supplemen- 
tary Information for more details. 

Non-hierarchical systems that have broken up shortly after birth 
lead to a close binary and a detached single star that is moving away 
from the binary. Those with small velocity differences, or with motions 
mainly along the line of sight, will be observed to linger for a while in 
the vicinity of each other, mimicking a bound binary. It is of interest to 
determine what is the fraction of stable bound, unstable bound and 
unbound triple systems that exist at different projected separations at 
different ages. Figure 4 shows the number and fraction of systems in 
each category for 1 Myr, 10 Myr and 100 Myr. As a specific example, 
the recently discovered very wide (~ 12,000-40,000 av) triple systems 
found in 7-10-Myr-old associations*” have only about a 20% chance 
of being long-lived systems, a 20% chance of already being disrupted sys- 
tems, but a 60% chance that they are unstable but still bound systems. 

At 100 Myr, 8.5% of the 180,218 simulations have led to bound 
(stable and unstable) systems with semimajor axes a between 1,000 
and 10,000 Au, and 2.1% are bound with a > 10,000 av. That is, more 
than 10% of the birth population of triple systems end up as wide or 
very wide, in excellent agreement with observations”. The simulations 
also broadly reproduce the observed distribution of projected separa- 
tions'**; see Supplementary Information for details. The present 
N-body simulations have thus demonstrated that the widest binaries 
known can arise naturally as a consequence of three-body dynamics 
shortly after birth. The subsequent, lengthy unfolding of the widest 
systems offers increased protection against external disruption by 
other young stars, and allows such wide systems to be born in both 
stellar associations as well as the outskirts of dense clusters. 
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An unexpectedly low oscillator strength as the origin 
of the Fe xvil emission problem 


S. Bernitt', G. V. Brown’, J. K. Rudolph’, R. Steinbriigge’, A. Graf’, M. Leutenegger*®, S. W. Epp’, S. Eberle!, K. Kubiéek'+, 
V. Mickel}, M. C. Simon’, E. Triibert?, E. W. Magee’, C. Beilmann!, N. Hell*®, S. Schippers’, A. Miiller?, S. M. Kahn’, 
A. Surzhykov'®""", Z. Harman”, C. H. Keitel', J. Clementson?, F. S. Porter*, W. Schlotter’, J. J. Turner!’, J. Ullrich't, 


P. Beiersdorfer? & J. R. Crespo Lopez-Urrutia’ 


Highly charged iron (Fe’®*, here referred to as Fe xvi) produces 
some of the brightest X-ray emission lines from hot astrophysical 
objects’, including galaxy clusters and stellar coronae, and it domi- 
nates the emission of the Sun at wavelengths near 15 angstroms. 
The Fexvui spectrum is, however, poorly fitted by even the best 
astrophysical models. A particular problem has been that the 
intensity of the strongest Fe xvui line is generally weaker than pre- 
dicted”*. This has affected the interpretation of observations by the 
Chandra and XMM-Newton orbiting X-ray missions’, fuelling a 
continuing controversy over whether this discrepancy is caused by 
incomplete modelling of the plasma environment in these objects 
or by shortcomings in the treatment of the underlying atomic 
physics. Here we report the results of an experiment in which a 


target of iron ions was induced to fluoresce by subjecting it to 
femtosecond X-ray pulses from a free-electron laser*; our aim 
was to isolate a key aspect of the quantum mechanical description 
of the line emission. Surprisingly, we find a relative oscillator 
strength that is unexpectedly low, differing by 3.6¢ from the best 
quantum mechanical calculations. Our measurements suggest that 
the poor agreement is rooted in the quality of the underlying 
atomic wavefunctions rather than in insufficient modelling of 
collisional processes. 

Past laboratory measurements using magnetic fusion devices” have 
ruled out some of the astrophysical explanations for the weakness of 
the strongest Fe xvi line (labelled 3C when first observed in the Sun 
40 yr ago’; see Fig. 1), including absorption and scattering by a large 
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Figure 1 | X-ray spectrum of Capella, and measured as well as calculated 
values of the 3C/3D intensity ratio. Main panel, the spectrum of Capella (« 
Aurigae) observed by the Chandra X-ray Observatory” is dominated by lines 
from iron ions of different charge, as indicated by the blue labels. Here, the 
detected photon flux in arbitrary units is plotted versus the photon energy. 

Fe xvii indicates emission from Fe!°* (that is, iron from which 16 electrons were 
removed), Fe xvii refers to emission from Fe’, and so on. The energy range 
covered by the present measurement is shaded in grey, corresponding to 
wavelengths near 15 A. The Fe xvui lines 3C and 3D are highlighted in green and 
refer to the (2p”)12(3d)3/2 J = 1) > 2p° (J = 0) and (2p")3,a(3d)52 (J = 1) > 2p* 
(J = 0) transitions at 826 eV and 812 eV, respectively. The Fe xvi emission 


dominates astrophysical spectra for temperatures between about 2 MK and 

8 MK (ref. 1). Inset, illustration of some of the available calculated values'*'®*?** 
and various astrophysical and laboratory observations. Ranges of measured 
values from the Princeton Large Torus (PLT) tokamak”® and from the 
Livermore electron beam ion trap (EBIT)'®”° are shown together with 
astrophysical values from NGC 4635’, the Sun””** and Capella””’. Where 
available, 1¢ error bars are shown for the astrophysical values. The EBIT range 
splits into two parts: the upper range (singly hatched) is for measurements where 
only Fe xvii ions were considered; the lower range (cross hatched) was measured 
when various amounts of Fe XvI ions were co-mixed with Fe xv ions. In the 
latter case, Fe xvi lines blend with line 3D, increasing its apparent line strength. 
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Figure 2 | Time coincidence between the free-electron laser (FEL) pulse and 
detected X-ray fluorescence. The horizontal axis shows the FEL photon 
energy, the vertical axis shows the time after a trigger signal, which is provided 
approximately 1 j1s before the FEL fires. The colour code indicates the photon 
count rate per time and energy, from 0 (white) to 15 (red) in Hz us tev t. 
The FEL energy was continuously scanned (200 minutes) from approximately 
809 eV to 829 eV. Fluorescence is seen from both Fe xvi and Fe xvi, but it is 
observed only in a small time window following the FEL pulses. The red box 
(magnified in the inset) shows one of the resonances with the corresponding 
resolutions in time (180 ns) and photon energy (0.8 eV). The photon energy 
resolution is determined by the intrinsic energy resolution of the FEL. Because 
the radiative decay times of the transitions of interest are below a picosecond, all 
fluorescence photons resulting from a single, femtosecond LCLS pulse are 
registered as simultaneous by the high-purity Ge detector used to detect the 
fluorescence X-rays, and the time window is solely determined by the time 
resolution (180 ns) of the detector. By gating over a 600-ns-wide time region 
(indicated by the green lines), we included essentially all fluorescence photons, 
while suppressing the background (produced by the interaction of the target 
ions with the electron beam from FLASH-EBIT) by a factor of 14,000. The 
much larger number of photons detected outside this time window was used 
for background determination with very high statistical significance. The 
background corrected spectra produced following these procedures are 
shown in Fig. 3. 


column of intervening media*’. Targeted measurements, which used 
an electron beam to produce and probe the ions, have furthermore 
found line blends that can increase the apparent intensity of some of 
the Fexvu lines'®. Very importantly, such measurements have pro- 
duced absolute collisional excitation cross-sections'!, which, like the 
line intensities, have not been reproduced by atomic calculations at the 
level needed to model astrophysical spectra. Although such laboratory 
measurements have produced spectra that agree very well with those 
observed from celestial sources (see Fig. 1 inset), they have not yet 
pinpointed the fundamental reason why spectral models do not pro- 
vide the desired agreement with observations. 

The consensus among most atomic theorists has been that the 
remaining problem lies with the calculation of collisional effects, that 
is, with deficiencies in atomic scattering calculations of the electron- 
impact excitation cross-section, and special emphasis was therefore 
placed on correctly including resonances with the help of sophisticated 
R-matrix methods’*. Numerous resolutions of the problem have been 
proposed'*”, including the suggestion that kinetic contributions from 
other levels, which include radiative cascades, collisional depopulation 
and photon scattering, might need revision depending on the parti- 
cular plasma conditions'*. Another possibility’* is that the problem may 
lie in the incorrect or incomplete description of the mixing between 
wavefunctions of the same parity and total angular momentum (this 
mixing can also be viewed as an electron-electron correlation issue). 
The intensity ratio of the Fe xvii lines 3C and 3D (see Fig. 1) is the key to 
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Figure 3 | Measured X-ray fluorescence spectra. a, A typical fluorescence 
spectrum of Fe xvi and Fe xvi when both ion species are among the target ions; 
b, a typical fluorescence spectrum of Fe xvi alone, which is produced in the 
absence of Fe!®* target ions. Error bars, 1o uncertainties. Fe xvi lines are 
labelled A, B and C in red; the Fe xvu lines are labelled in green. Both spectra are 
normalized to the highest peak (line B). The blend of line 3D from Fe xv and 
line C from Fe xvi has to be taken into account when comparing the intensities 
of 3C and 3D. The plotted fluorescent photon yield has been adjusted for 
independently monitored variations in the FEL X-ray intensity as a function of 
FEL energy. 


resolving this controversy. Line 3D is forbidden in the Russell-Saunders 
or LS-coupling approach to constructing wavefunctions. It only exists 
because of mixing with 3C: the more their upper levels mix, the larger 
the radiative decay rate and thus the larger the oscillator strength of 3D 
at the expense of 3C. Because excitation cross-sections vary mono- 
tonically with the oscillator strength, as illustrated in Supplementary 
Information, the increased mixing results in a higher intensity of 3D 
relative to that of 3C. Mixing, of course, is not restricted to just these two 
levels, but occurs among all levels with the same parity and total angular 
momentum. Because there are an infinite number of atomic levels, 
theory must introduce approximations to this correlation issue when 
constructing a particular wavefunction. 

In a typical plasma, the intensity ratio of lines 3C and 3D not only 
reflects the mixing among the electronic levels but also depends on the 
above-mentioned kinetic contributions and on collisional effects, such 
as electron-impact excitation, resonance excitation, inner-shell ioniza- 
tion and dielectronic recombination, which need to be integrated over 
the plasma electron distribution function. Hence, the effect of quantum 
mechanical mixing cannot be cleanly isolated from the plasma envir- 
onment, which is described by such quantities as the temperature, 
density, flows, opacity, spatial gradients and temporal variations, so that 
the question of why the measured ratio does not agree with theory 
remains obscure and controversial. 

X-ray laser spectroscopy” provides a new experimental method to 
resolve this controversy because the excitation and fluorescence of the 
levels depend only on the atomic wavefunctions and not on electronic 
collisions. In other words, the intensity of the 3C and 3D lines excited 
by an X-ray laser is directly proportional to the oscillator strength of 
each line, providing a direct test of the accuracy of the (mixed) wave- 
functions that is totally unaffected by the plasma environment, col- 
lisional effects and atomic kinetics. For such a measurement we need a 
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Figure 4 | Predicted and measured intensity ratios of lines 3C and 3D. The 
experimental value and associated 1o uncertainty are shown as a green band. 
The red circles are values produced by wavefunctions used in typical scattering 
calculations'®*!”*. These wavefunctions are not optimized to account for 
mixing among more than a few electronic levels. The blue square indicates the 
ratio resulting from wavefunctions employed in a newer scattering calculation 
that employed more fully converged wavefunctions'*. The green diamonds 
illustrate the ratios resulting from wavefunctions generated with the MCDF 
ansatz (see Supplementary Information) at various levels of completeness (that 
is, convergence) of mixing among the electronic levels: mixing among nearly 
800 levels (a); mixing among nearly 90,000 levels (b); and extrapolation to 
mixing among “all’ (infinitely many) levels (c). The filled orange diamonds 
show the ratios arising from wavefunctions produced in calculations using the 
MBPT ansatz. This ansatz is designed to optimize the treatment of mixing 
among electronic levels'*** using perturbative methods. Perturbative 
calculations were carried out to second order only, but the resulting 
wavefunctions produce ratios of the oscillator strengths that are already lower 
than those produced by other methods. Higher-order MBPT calculations do 
not yet exist. The dark blue cross represents the value listed in National Institute 
for Standards and Technology (NIST) atomic database’. The assessed 
uncertainty (rated ‘C+’ (or 18%) and ‘D’ (or +50%) for the relevant Fe xvu 
lines) associated with the ratio is as large as 50%, which provides overlap with 
our measurement and enough room for future theoretical improvement using a 
new quantum mechanical ansatz. 


target of Fe'°* and ultrashort, very intense X-ray pulses capable of 


exciting sufficient numbers of iron ions for detection. The advent of 
a new kind of ultra-brilliant light source, the free-electron X-ray laser, 
coupled with a target of trapped, highly charged ions provided by 
an electron beam ion trap (EBIT)'*, now makes such measurements 
possible. 

The experiment was carried out at the Linac Coherent Light Source 
(LCLS)* free-electron X-ray laser, where the transportable FLASH- 
EBIT”’ was installed as an end-station in the Soft X-ray Materials 
Science (SXR) area*® (see Supplementary Information). Efficient 
detection of the X-ray-induced fluorescence, which took advantage 
of the femtosecond time structure of the LCLS photon beam, was 
the key to a successful measurement (Fig. 2). We performed LCLS 
photon energy scans at EBIT electron beam energies both above and 
below the threshold for the production of Fe'°* ions. Using this 
method, we obtained spectra of Fe xvi and Fe xvi together, as well 
as Fe xvi alone (Fig. 3), allowing us to take into account blending of 
Fe xvi and Fe xvul lines. 

The linear polarization of the LCLS photon beam causes aniso- 
tropic angular emission patterns for the different X-ray transitions. 
Therefore, the fluorescence detection efficiency is not the same for all 
the lines but depends on the detection geometry. Fortunately, lines 3C 
and 3D have identical emission patterns, as discussed in Supplemen- 
tary Information, precluding a possible source of uncertainty. The 
photon beam intensity monitor of the SXR beamline did not provide 
absolute photon fluxes. However, we accurately measured the fluo- 
rescence intensity ratio of 3C and 3D as 2.61 + 0.23, averaged over two 
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independent measurement periods (see Supplementary Information 
and Supplementary Fig. 2), after accounting for the blend of 3D with 
the Fe xvi line C. 

The measured fluorescence ratio can be directly compared to quan- 
tum mechanical calculations of the oscillator strengths of lines 3C and 
3D, as presented in Fig. 4. Computational limitations typically force 
scattering calculations to rely on some of the poorest approxima- 
tions for atomic wavefunctions, in which the mixing of the target 
wavefunctions may be poorly converged. Indeed, typical scattering 
calculations'®”’** have been based on wavefunctions that predict ratios 
of the oscillator strengths between 4.0 and 5.0, as summarized in Fig. 4. 
Some newer R-matrix scattering calculations’? have now specifically 
employed more refined methods to construct wavefunctions, which 
give a ratio as lowas 3.54. Indeed, focused atomic structure calculations 
designed to produce the best wavefunctions can produce even lower 
ratios of the oscillator strengths, depending on the quantum mecha- 
nical ansatz used. The multiconfiguration Dirac-Fock (MCDF) ansatz 
(Supplementary Fig. 2) produces a ratio that drops from 4.5, when 
including mixing among only a reasonable number of wavefunction, 
to 3.49 when essentially fully converged, that is, when including mixing 
among a very large number of wavefunctions and thus aiming to 
approximate the effect of including all levels. The many-body per- 
turbation theory (MBPT) ansatz gives 3.50 or even 3.43 in second order 
approximation'®” (Fig. 4). The MBPT ansatz might give an even lower 
ratio, if the calculation were carried out to higher order. However, such 
a refined MBPT calculation has not yet been done. In other words, our 
experiment intimates that quantum mechanics has reached a point 
where the dominant uncertainties lie in the wavefunctions themselves, 
and a refined MBPT calculation, or yet another different ansatz that 
includes not only a more complete calculation of correlation effects, 
but also accounts for quantum electrodynamical terms and the time 
dependence of atomic dynamics, may be needed to reproduce the 
experimental value. 

Future quantum mechanical calculations may bring the relative 
oscillator strengths, and thus the ratio of collisional cross-sections 
associated with them, into closer agreement with our measured value. 
This would bring the predicted Fe xvii intensity ratios into agreement 
with those measured in collisional plasmas in the laboratory and 
observed from X-ray sources in space. 
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Mid-infrared frequency comb based on a quantum 


cascade laser 


Andreas Hugi’, Gustavo Villares', Stephane Blaser’, H. C. Liu? & Jéréme Faist! 


Optical frequency combs’ act as rulers in the frequency domain 
and have opened new avenues in many fields such as fundamental 
time metrology, spectroscopy and frequency synthesis. In particu- 
lar, spectroscopy by means of optical frequency combs has sur- 
passed the precision and speed of Fourier spectrometers. Such a 
spectroscopy technique is especially relevant for the mid-infrared 
range, where the fundamental rotational-vibrational bands of 
most light molecules are found’. Most mid-infrared comb sources 
are based on down-conversion of near-infrared, mode-locked, 
ultrafast lasers using nonlinear crystals*. Their use in frequency 
comb spectroscopy applications has resulted in an unequalled 
combination of spectral coverage, resolution and sensitivity*”’. 
Another means of comb generation is pumping an ultrahigh- 
quality factor microresonator with a continuous-wave laser*”°. 
However, these combs depend on a chain of optical components, 
which limits their use. Therefore, to widen the spectroscopic appli- 
cations of such mid-infrared combs, a more direct and compact 
generation scheme, using electrical injection, is preferable. Here 
we present a compact, broadband, semiconductor frequency comb 
generator that operates in the mid-infrared. We demonstrate 
that the modes of a continuous-wave, free-running, broadband 
quantum cascade laser” are phase-locked. Combining mode pro- 
liferation based on four-wave mixing with gain provided by the 
quantum cascade laser leads to a phase relation similar to that of a 
frequency-modulated laser. The comb centre carrier wavelength 
is 7 micrometres. We identify a narrow drive current range with 
intermode beat linewidths narrower than 10 hertz. We find comb 
bandwidths of 4.4 per cent with an intermode stability of less than 
or equal to 200 hertz. The intermode beat can be varied over a 
frequency range of 65 kilohertz by radio-frequency injection. The 
large gain bandwidth and independent control over the carrier 
frequency offset and the mode spacing open the way to broadband, 
compact, all-solid-state mid-infrared spectrometers. 

Unlike the situation in interband semiconductor lasers!*, mode- 
locked operation of typical high-performance, room-temperature 
quantum cascade lasers with emission of short single pulses is difficult 
to achieve because the gain recovery time (t ~ 0.3 ps) is much shorter 
than the time required by the pulse to complete one round-trip in the 
cavity (the round-trip time, t,, = 64 ps), which defines the mode spa- 
cing. Such pulsed operation has so far only been achieved at cryogenic 
temperatures" over a locking bandwidth of 15 cm‘ at 77 K by incor- 
porating an active region that used a diagonal transition with a very 
long upper-state lifetime’*. Similarly, mode locking of ten modes has 
also been achieved at cryogenic temperature in terahertz quantum 
cascade lasers". It should be noted, however, that the only requirement 
for comb operation in spectroscopy applications is the periodicity of 
the waveform at the round-trip frequency, not the generation of high- 
intensity pulses. Even though microcavity-based frequency combs 
have shown ‘true’ comb operation with millihertz stability, their out- 
put is sometimes nearly unmodulated in time'®’”. In fact, because the 
intensity of a perfectly frequency-modulated laser is constant, the 


power envelope of such a beam would not be perturbed by the fast 
gain recovery of the quantum cascade laser and the spectrum of such a 
laser would also be composed of equally spaced and discrete spectral 
lines. Furthermore, as in radio communication, a frequency-modulated 
signal may be more robust to noise than an amplitude-modulated 
signal. In addition, a constant output power can be preferable for linear 
spectroscopy applications, to avoid additional complexities caused by 
nonlinear interactions. 

Nonlinear effects in quantum cascade lasers, triggering the onset of 
multimode operation, have been studied in detail'*. As in interband 
lasers’’, spatial hole burning was found to have a key role in the onset 
of multimode instabilities. Two counterpropagating waves at frequen- 
cies «, and «, create a grating that beats at a frequency dw = @, — @, 
transferring energy between modes separated by that frequency dif- 
ference. This gain mechanism is phase sensitive. In addition, intersub- 
band transitions feature strong third-order optical nonlinearities, ®, 
due to the large optical matrix element between the excited state and 
the empty lower states*” ~’, allowing parametric processes due to four- 
wave mixing. It is important to stress that because these processes are 
quasi-resonant, the coupling between the modes will occur with a non- 
trivial phase delay. As shown schematically in Fig. la, the result of 
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Figure 1 | Optical frequency comb in a quantum cascade laser. a, The 
dispersed Fabry—Pérot resonator modes lock to the equally spaced modes 
generated by four-wave mixing processes. b, Gain and GVD simulations of a 
narrowband (dotted) and a broadband (solid lines) quantum cascade laser. 
c, Magnified view of the laser spectra at J = 740 mA, measured using a high- 
resolution Fourier transform infrared spectrometer (0.0026 cm ', 78 MHz). 
a.u., arbitrary units. 
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these nonlinearities will be a proliferation of modes due to degenerate 
and non-degenerate four-wave mixing processes, in a similar fashion 
to what occurs in microcavity resonators’®. 

The ability to lock longitudinal modes is limited by the material 
group velocity dispersion (GVD). Our heterogeneous quantum cas- 
cade laser is expected to have a much lower GVD in comparison with 
conventional devices’. It operates close to the zero-GVD point of our 
device, a InGaAs/InAlAs heterostructure grown on InP; the low dop- 
ing of the active region and the waveguide design decrease the contri- 
bution from these components. In addition, as shown in Fig. 1b, and in 
contrast to conventional quantum cascade lasers, the computed GVD 
has a broad minimum in the centre of the gain curve owing to the “flat 
top’ shape of the gain curve of the broadband laser. 

The broadband quantum cascade laser used in this work was com- 
posed of three different substacks and operated at a wavelength of 
around 7 tm. The laser spectra partly displayed in Fig. 1c were mea- 
sured using a high-resolution Fourier transform infrared spectrometer 
and show a single set of longitudinal modes extending over 1.1 1m, 
with no measurable dispersion within the resolution (0.0026 cm}, 
78 MHz) of the spectrometer over the whole dynamic range (Sup- 
plementary Fig. 1). All instabilities related to multiple transverse modes 
can be completely neglected. A selected set of spectra indicating the 
power per mode (between 1)W and 3mW) is displayed in Sup- 
plementary Fig. 2a. The total output power as a function of current at 
—20 °C is shown in Supplementary Fig. 2b. The device switches to multi- 
mode operation at I = 460 mA, shortly above threshold (Jy, = 400 mA). 
The spectrum, which is initially spread over dw = 40 cm ', broadens 
progressively to dw = 230cm ~ at the maximum current. 
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Figure 2 | Intermode beat spectroscopy. a, Narrow intermode beat, with a 
full-width at half-maximum of <10 Hzat the onset of the multimode emission. 
The resolution bandwidth of the spectrum analyser is set to 10 Hz. b, Intensity 
interferogram and intermode beat interferograms measured in two successive 
scans separated by 90 min, at 490 mA. The resolution bandwidth is 100 kHz. 
QWIP, Quantum-well infrared photodetector. c, Comparison of the measured 
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To investigate to what extent the modes are coupled, we first mea- 
sure the intermode beat, that is, the radio-frequency spectrum of the 
intensity near the round-trip frequency, v,,, using a fast quantum-well 
infrared photodetector’. Uncoupled modes (such as those originating 
from two free-running lasers driven in a similar way to our device) 
would generate a spectrum with a 5-10-MHz linewidth”>. In contrast, 
as shown in Fig. 2a, such a measurement performed just above the 
onset of multimode operation shows a linewidth with a full-width at 
half-maximum of only dv = 8.8 Hz. For this current, the spectrum is 
already 5@ = 15cm‘ wide and contains 60 modes. This very narrow 
intermode beat is observed over the current range J ~ 460-480 mA. At 
higher temperatures (13 °C), this regime is more dominant and the 
comb spans up to 60cm | (240 modes) with an intermode beat line- 
width of =200Hz. At a current of =480mA, the intermode beat 
abruptly becomes wider: dv ~ 48 kHz. The intensity of the modulation 
of the laser output is never greater than 2%, indicating a mostly con- 
stant amplitude. The abrupt change in intermode beat linewidth is also 
reflected in the amplitude noise spectra shown in Supplementary Fig. 
3. When the current is increased to 500mA, the spectrum spans 
~100cm 7! and the intermode beat widens to 5v < 250 kHz. Above 
510mA, the single narrow intermode beat splits into two distinct 
broadened peaks spaced by 14 MHz. 

The assessment of the coherence properties of a mode-locked laser 
is usually performed by an autocorrelation measurement on a non- 
linear crystal to reveal the time structure of the optical pulse. This 
technique cannot be used in our case, because we do not expect the 
respective phases of the longitudinal modes to yield a single optical 
pulse with high peak intensity, suitable for nonlinear optics. To gain 
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delay. d, Comparison of the intermode beat spectrum, J(V1,@), and the 
intensity spectrum, 7(0,q), at an injection current of 500 mA. 
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insight into the respective phases and coherence properties of 
the modes, we present here an interferometric technique whereby 
the autocorrelation of the intermode beat is measured using a 
Michelson interferometer. The quantity measured is 


=|F (E(t) +E(t+7)|’)| (1) 


which is the absolute value of the Fourier component of the intensity at 
the detector at frequency v over a chosen resolution bandwidth of the 
spectrum analyser. Here E denotes the time-dependent amplitude of 
the electric field and F stands for the Fourier transform that is applied 
to the resulting intensity. The normal intensity interferogram mea- 
sured in a Fourier transform infrared spectrometer is I(0,t). In con- 
trast to the intensity interferogram, the intermode beat interferogram, 
(VT), is a function of both the amplitude and the relative phase of 
the modes. Furthermore, in analogy with Fourier spectroscopy, the 
Fourier transform 


I (Vrt5@) =F(I(vyt,T)) (2) 


will yield the intermode beat spectrum, which roughly indicates the 
spectral regions contributing to the intermode beat. 

Figure 2b shows two successive scans of I(v,,7) acquired with a 
resolution bandwidth of 100 kHz at the round-trip frequency, mea- 
sured at 90-min intervals and an injection current of 490 mA. These 
scans (blue and red) are compared with the zero-frequency compon- 
ent J(0, t) (green). Randomly varying phases would generate an inter- 
mode beat interferogram roughly proportional to the zero-frequency 
component around zero time delay (Supplementary Fig. 4). In con- 
trast, the measured I(v,,, 7) shows long-term stability with a minimum 
at zero time delay, proving that there is a well-defined phase relation 
between the modes. The phase relation cannot be easily inferred from 
the measurement of I(v,;, tT), however. To help us do so, we compare 
the measured I(v,,,7) to the respective signals predicted for a purely 
frequency-modulated signal and a conventional mode-locked laser 
(Fig. 2c; see Supplementary Fig. 5 for a measurement of an intermode 
beat interferogram of a mode-locked fibre laser). The phases are such 
that the amplitude modulation of the signal is strongly damped at 
zero delay (t = 0), as it would be for a frequency-modulated laser. In 
general, the comparison emphasizes the frequency modulation char- 
acteristics of the output of the laser, as expected from the short lifetime 
of the excited state. A minimum of the intermode beat is always 
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observed for t = 0 over all injected currents, indicating that the de- 
vice never produces single pulses. To confirm further the frequency- 
modulated nature of the laser, we put a sheet of polyethylene, which 
has strong wavelength-dependent absorption, between the laser and 
the detector. This sheet acts as an optical discriminator that can 
convert the frequency modulation of the laser output to an amplitude 
modulation signal. As expected from a frequency-modulated signal, 
we find that the amplitude of the intermode beat at the detector 
increases. This amplification factor was measured to be 10-18. 

The intermode beat spectrum, 7(v;,@), and the intensity spec- 
trum, 7(0,@), are compared in Fig. 2d and show that the entire laser 
spectrum contributes to the intermode beat. Regardless of the kilohertz 
intermode beat linewidth, we have a comb-like spectrum because the 
mode spacing does not fluctuate more than the chosen resolution 
bandwidth of the spectrum analyser over the whole laser spectrum. 
The measurement of the whole radio-frequency spectrum around v,; 
as a function of t yields important information about the mechanisms 
that destabilize the comb. An example of such characteristics is shown 
in Supplementary Fig. 6a at a high current (I= 679 mA) where the 
intermode beat has two peaks and a wide radio-frequency spectrum. 
By further increasing the size of the region with low GVD, it should be 
straightforward to increase the comb bandwidth. 

To characterize the comb further, we did a heterodyne beat experi- 
ment between a comb line and a single-mode distributed-feedback 
quantum cascade laser (Supplementary Fig. 7a). The inset of Fig. 3a 
shows the spectrum of the single-mode quantum cascade laser along- 
side the comb spectrum that covers 60cm ' and has an intermode 
beat linewidth of =200 Hz. Figure 3a shows the corresponding hete- 
rodyne signal. A narrow linewidth of 1.3 MHz is measured, which 
simply reflects the temperature and current stability of our set-up. 
The increased phase noise directly translates into a broader linewidth 
of about 20 MHz in comb-like spectra. 

Our measurements show that the low GVD, as well as the existence 
of nonlinearities, tends to lock the modes together. This locking can 
also be enhanced through radio-frequency injection. Such injection 
locking has already been successfully applied to a terahertz quantum 
cascade laser, enabling the demonstration of pulsed mode-locked ope- 
ration. In our experiment, a radio-frequency power of 11 dBm is in- 
jected close to the round-trip frequency of 7.5 GHz, with an estimated 
injection loss of —34.5 dB. The direct current is set to 494 mA. While 
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Figure 3 | Linewidth measurement and radio-frequency injection. 

a, Heterodyne beat measurement between a single-mode distributed-feedback 
(DFB) quantum cascade laser and a quantum-cascade-laser comb line. The 
inset shows the two spectra. The measured heterodyne signal has a linewidth of 
1.3 MHz. The resolution bandwidth is 500 kHz and the integration time is 

25 ms. b, Heterodyne beat measurement of a comb-like line and a stable 
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single-mode quantum cascade laser while sweeping the radio-frequency (RF) 
injection frequency around the cavity round-trip frequency. When sweeping 
across the resonance of the cavity round-trip, the heterodyne signal can be 
shifted in frequency by 11 MHz. Inset, measured intermode beat at three 
different characteristic injection frequencies at resonance. 
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sweeping the radio frequency, we measure the heterodyne beat 
between one comb-like line and the single-mode quantum cascade 
laser (Supplementary Fig. 7a). Figure 3b shows the change in the 
heterodyne beat position and proves that we are able to change the 
mode spacing actively. The intermode beat with radio-frequency injec- 
tion around resonance is shown in the inset of Fig. 3b. Unlike previous 
attempts’, our measurements show the expected collapse of the inter- 
mode beat at the round-trip frequency to the resolution of the spec- 
trum analyser. The round-trip frequency can be varied over a range of 
more than 65 kHz. There are approximately 167 modes between the 
peak of the laser and the mode that beats with the single-mode quan- 
tum cascade laser. Changing the round-trip frequency by 65 kHz 
should therefore result in a change of 167 X 65 kHz = 10.9 MHz in 
the heterodyne beat signal, which agrees with the measured 11 MHz. 
It should be noted that the carrier frequency offset can be adjusted by 
the injected direct current (Supplementary Fig. 7b), whereas the mode 
spacing can be set through the radio-frequency injection. Therefore, it 
is possible to stabilize quantum-cascade-laser frequency combs fully”’. 
However, as the injected radio-frequency power is increased to 
26.6 dBm, the intermode beat interferogram resembles that of a fun- 
damentally mode-locked laser, and, unlike the case for 11 dBm, the 
intermode beat spectrum shows that only a small fraction (18%) of 
the modes contribute to the injection-locked intermode beat (Sup- 
plementary Fig. 8). 

We have presented a mid-infrared quantum-cascade-laser free- 
running optical frequency comb covering 60 cm (308 nm) at a cen- 
tre wavenumber of 1,430 cm‘ (7 tm). The measured linewidth of the 
comb lines is =1.3 MHz with a narrow intermode beat linewidth of 
=200 Hz. At higher currents, we measure comb-like emission spectra 
that span 100 cm‘ and have broader intermode beat linewidths (50- 
250 kHz). Further measurements using metrological techniques will 
give deeper insight into this system”*. We have shown that the carrier- 
envelope offset frequency as well as the repetition frequency can be 
set independently by changing the direct current and through radio- 
frequency injection. The phase signature of the comb strongly resem- 
bles that of a frequency-modulated signal and is stable over time. We 
identify the locking mechanism responsible for the formation of the 
frequency comb to be four-wave mixing. Key physical properties of 
materials used in quantum cascade lasers in frequency combs are 
therefore strong third-order optical nonlinearities and a low GVD. 
Further investigation of the GVD and its sign is necessary”. Al- 
together, this system offers remarkable freedom of design, because it 
allows control over the electronic transitions, gain flatness, optical 
nonlinearities and dispersion through band structure engineering. 
The bandwidth of mid-infrared quantum-cascade-laser frequency 
combs can be improved through a detailed understanding of the 
phase-locking mechanism and by appropriately engineering the 
GVD using waveguide design and facet coatings. The large bandwidth, 
in combination with independent control over the carrier frequency 
offset and the mode spacing, opens the way to metrological and sens- 
ing applications. Thus, the mid-infrared quantum-cascade-laser fre- 
quency comb could become standard technology for broadband, 
compact, all-solid-state mid-infrared spectrometers. 


METHODS SUMMARY 

Calculation of the GVD. We include three contributions to the GVD in our 
calculations. First, the vertical and lateral modal GVDs of the guided mode 
(GVD ert and GVDjat). GVDyert includes the dispersion of the doped layers due 
to free-carrier absorption and material dispersion. Second, the GVD due to the 
gain and loss in the active region. These produce a change in the refractive index, 
An», calculated through Kramers-Kronig relations. The gain is calculated using 
software based on a density matrix model extended to the whole structure, which 
also includes second-order gain and loss mechanism between all sub-band pairs”. 
The total losses are set to 2.5cm~'. Because the broadband active region is com- 
posed of three substacks, each individual stack is simulated again including the loss 
introduced through cross-absorption. This takes into account the inhomogeneous 
broadening of these devices. Third, we account for the GVD due to material 
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dispersion. The total GVD is calculated as a sum of the individual terms weighted 
by the modal overlap. 

Intermode beat spectroscopy set-up. Supplementary Fig. 9 shows the intermode 
beat spectroscopy set-up. The laser light is collimated through an antireflection- 
coated ZnSe lens. The light passes through an antireflection-coated quarter-wave 
plate to reduce the effect of feedback originating from reflections from optical 
components. The light passes through a Fourier transform infrared spectrometer 
(Bruker IFS66/S) and is focused through an antireflection-coated ZnSe lens on a 
80 um X 80 um quantum-well infrared photodetector cooled to the temperature 
of liquid nitrogen. The Fourier transform infrared spectrometer is run in step-scan 
mode. At each step, we record the radio-frequency spectrum at the round-trip 
frequency of the laser. Concurrently, we record the intensity component of the 
interferogram through the bias current of the photodetector. 


Received 25 May; accepted 25 September 2012. 


1. Uder, T., Holzwarth, R. & Hansch, T. W. Optical frequency metrology. Nature 416, 
233-237 (2002). 

2. Keilmann, F., Gohle, C. & Holzwarth, R. Time-domain mid-infrared frequency- 
comb spectrometer. Opt. Lett. 29, 1542-1544 (2004). 

3. Adler, F. etal. Phase-stabilized, 1.5 W frequency comb at 2.8-4.8 um. Opt. Lett 34, 
1330-1332 (2009). 

4. Johnson, T.A.& Diddams, S. A. Mid-infrared upconversion spectroscopy based on 
a Yb:fiber femtosecond laser. Appl. Phys. B 107, 31-39 (2012). 

5. Foltynowicz, A., Ban, T., Maslowski, P., Adler, F. & Ye, J. Quantum-noise-limited 
optical frequency comb spectroscopy. Phys. Rev. Lett. 107, 233002 (2011). 

6. Adler, F., Thorpe, M.J., Cossel, K.C. & Ye, J. Cavity-enhanced direct frequency comb 
spectroscopy: technology and applications. Annu. Rev. Anal. Chem. 3, 175-205 
(2010). 

7. Vodopyanov, K.L., Sorokin, E., Sorokina, |. T. & Schunemann, P. G. Mid-IR frequency 
comb source spanning 4.4—5.4 um based on subharmonic GaAs optical 
parametric oscillator. Opt. Lett. 36, 2275-2277 (2011). 

8. Del’Haye, P. et al. Optical frequency comb generation from a monolithic 
microresonator. Nature 450, 1214-1217 (2007). 

9. Wang, C. Y. et al. Mid-infrared optical frequency combs based on crystalline 
microresonators. Preprint at http://arxiv.org/abs/1109.2716 (2011). 

10. Kippenberg, T. J., Holzwarth, R. & Diddams, S. A. Microresonator-based optical 

frequency combs. Science 332, 555-559 (2011). 

11. Faist, J. et al. Quantum cascade laser. Science 264, 553-556 (1994). 

12. Derickson, D. J. et al. Short pulse generation using multisegment mode-locked 

semiconductor lasers. IEEE J. Quantum Electron. 28, 2186-2202 (1992). 

13. Wang, C. Y. etal. Mode-locked pulses from mid-infrared quantum cascade lasers. 

Opt. Express 17, 12929-12943 (2009). 

14. Faist, J. et al. Laser action by tuning the oscillator strength. Nature 387, 777-782 

(1997). 

15. Barbieri, S. et al. Coherent sampling of active mode-locked terahertz quantum 
cascade lasers and frequency synthesis. Nature Photon. 5, 306-313 (2011). 

6. Del’Haye, P., Arcizet, O., Schliesser, A. Holzwarth, R. & Kippenberg, T. Full 
stabilization of a microresonator-based optical frequency comb. Phys. Rev. Lett. 
101, 053903 (2008). 

7. Ferdous, F. et al. Spectral line-by-line pulse shaping of on-chip microresonator 
frequency combs. Nature Photon. 5, 770-776 (2011). 

8. Gordon, A. et al. Multimode regimes in quantum cascade lasers: from coherent 
instabilities to spatial hole burning. Phys. Rev. A 77, 053804 (2008). 

9. Yamada, M. Theoretical analysis of nonlinear optical phenomena taking into 
account the beating vibration of the electron density in semiconductor lasers. 

J. Appl. Phys. 66, 81-89 (1989). 

20. Walrod, D., Auyang, S. Y., Wolff, P. A. & Sugimoto, M. Observation of third order 
optical nonlinearity due to intersubband transitions in AlGaAs/GaAs superlattices. 
Appl. Phys. Lett. 59, 2932-2934 (1991). 

21. Rosencher, E. et a/. Quantum engineering of optical nonlinearities. Science 271, 
168-173 (1996). 

22. Banerjee, S. & Shore, K. A. MIR and NIR nonlinear optical processing using 
intersubband 7 in triple quantum well structures. Semicond. Sci, Technol. 18, 
655-660 (2003). 

23. Parz, W. et al. Intersubband gain-induced dispersion. Opt. Lett 34, 208-210 
(2009). 

24. Grant, P. D., Dudek, R., Buchanan, M. & Liu, H.C. Room-temperature heterodyne 
detection up to 110 GHz with a quantum-well infrared photodetector. /EEE Photon. 
Technol. Lett. 18, 2218-2220 (2006). 

25. Aellen, T. eta/. Direct measurement of the linewidth enhancement factor by optical 
heterodyning of an amplitude-modulated quantum cascade laser. Appl. Phys. Lett 
89, 091121 (2006). 

26. Soibel, A. et al. Active mode locking of broadband quantum cascade lasers. /EEE J. 
Quantum Electron. 40, 844-851 (2004). 

27. Telle, H. et al. Carrier-envelope offset phase control: a novel concept for absolute 
optical frequency measurement and ultrashort pulse generation. Appi. Phys. B 69, 
327-332 (1999). 

28. Braje, D., Hollberg, L. & Diddams, S. Brillouin-enhanced hyperparametric 
generation of an optical frequency comb in a monolithic highly nonlinear fiber 
cavity pumped by a cw laser. Phys. Rev. Lett. 102, 193902 (2009). 

29. Herr, T. et a/. Universal formation dynamics and noise of Kerr-frequency combs in 
microresonators. Nature Photon. 6, 480-487 (2012). 


©2012 Macmillan Publishers Limited. All rights reserved 


30. Terazzi, R. & Faist, J. A density matrix model of transport and radiation in quantum 
cascade lasers. N. J. Phys. 12, 033045 (2010). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We thank M. Quack and E. Miloglyadov for their help with the 
high-resolution Fourier transform infrared spectrometer measurements, U. Keller and 
V. Wittwer for providing scientific equipment, and J. Khurgin for discussions. This work 
was financially supported by the Quantum Photonics National Center of Competence 
in Research of the Swiss National Science Foundation. H.C.L. acknowledges support 
from the National Major Basic Research Projects (201 1CB925603) and the Shanghai 
Municipal Major Basic Research Project (09DJ1400102). 


LETTER 


Author Contributions A.H. carried out the measurements. Simulations and ideas were 
developed by A.H. and J.F. G.V. contributed to the heterodyne beat measurements as 
well as the amplitude noise measurements. S.B. provided both quantum cascade 
lasers. H.C.L. provided the quantum-well infrared photodetector. All the work was done 
under the supervision of J.F. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to J.F. (jfaist@ethz.ch) or H.C.L. 
(h.c.liu@sjtu.edu.cn). 


13 DECEMBER 2012 | VOL 492 | NATURE | 233 


©2012 Macmillan Publishers Limited. All rights reserved 


1 sid al Be 


doi:10.1038/nature11687 


Highly efficient organic light-emitting diodes from 


delayed fluorescence 


Hiroki Uoyama', Kenichi Goushi!?, Katsuyuki Shizu', Hiroko Nomura! & Chihaya Adachi)? 


The inherent flexibility afforded by molecular design has accele- 
rated the development of a wide variety of organic semiconductors 
over the past two decades. In particular, great advances have been 
made in the development of materials for organic light-emitting 
diodes (OLEDs), from early devices based on fluorescent mole- 
cules’ to those using phosphorescent molecules”*. In OLEDs, elec- 
trically injected charge carriers recombine to form singlet and 
triplet excitons in a 1:3 ratio’; the use of phosphorescent metal- 
organic complexes exploits the normally non-radiative triplet exci- 
tons and so enhances the overall electroluminescence efficiency””*. 
Here we report a class of metal-free organic electroluminescent 
molecules in which the energy gap between the singlet and triplet 
excited states is minimized by design*, thereby promoting highly 
efficient spin up-conversion from non-radiative triplet states to 
radiative singlet states while maintaining high radiative decay rates, 
of more than 10° decays per second. In other words, these mole- 
cules harness both singlet and triplet excitons for light emission 
through fluorescence decay channels, leading to an intrinsic fluor- 
escence efficiency in excess of 90 per cent and a very high external 
electroluminescence efficiency, of more than 19 per cent, which is 
comparable to that achieved in high-efficiency phosphorescence- 
based OLEDs’. 

The recombination of holes and electrons can produce light, in a 
process referred to as electroluminescence. Electroluminescence in 
organic materials was first discovered in 1953 using a cellulose film 
doped with acridine orange’, and was developed in 1963 using an 
anthracene single crystal connected to high-field carrier injection elec- 
trodes’. Electrical charge carriers of both polarities were injected into 
the organic layers, and the subsequent carrier transport and recom- 
bination produced blue electroluminescence originating from singlet 
excitons; that is, fluorescence. According to spin statistics, carrier 
recombination is expected to produce singlet and triplet excitons in 
a 1:3 ratio®’, and this ratio has been examined for many molecular 
systems*"*. The singlet excitons produced decay rapidly, yielding 
prompt electroluminescence (fluorescence). Two triplet excitons can 
combine to form a singlet exciton through triplet-triplet annihilation, 
which results in delayed electroluminescence (delayed fluorescence). 
Direct radiative decay of triplet excitons results in phosphorescence, 
but usually occurs only at very low temperatures in conventional 
organic aromatic compounds. The first demonstration of phospho- 
rescent electroluminescence using ketocoumarin derivatives in 1990’. 
However, the very faint electroluminescence was observed only at 
77 K, and with difficulty, and was assumed to be virtually useless even 
if included in rare-earth complexes, which should also involve both 
singlet and triplet excitons in electrical excitation’. In 1999, efficient 
electrophosphorescence was first demonstrated using iridium phenyl- 
pyridine complexes that achieve an efficient radiative decay rate of 
~10°s | by taking advantage of the strong spin-orbit coupling of 
iridium’. An internal electroluminescence efficiency of almost 100% 
was achieved’, providing convincing evidence that OLED technology 
can be useful for display and lighting applications. 


In the work reported here, we used a novel pathway to attain the 
greatest possible electroluminescence efficiency from simple aromatic 
compounds that exhibit efficient thermally activated delayed fluo- 
rescence (TADF) with high photoluminescence efficiency. Figure la 
shows the energy diagram ofa conventional organic molecule, depicting 
singlet (S,) and triplet (T,) excited states and a ground state (Sp). It was 
previously assumed that the S; level was considerably higher in energy 
than the T, level, by 0.5-1.0eV, because of the electron exchange 
energy between these levels. However, we found that careful design 
of organic molecules can lead to a small energy gap (AEs) between S, 
and T; levels*'*. Correspondingly, a molecule with efficient TADF 
requires a very small AEsy between its S; and T, excited states, which 
enhances T,— S, reverse intersystem crossing (ISC). Such excited 
states are attainable by intramolecular charge transfer within systems 
containing spatially separated donor and acceptor moieties*. The cri- 
tical point of this molecular design is the combination of a small AEgr, 
of < 100 meV, with a reasonable radiative decay rate, of > 10°s |, to 
overcome competitive non-radiative decay pathways, leading to highly 
luminescent TADF materials. Because these two properties conflict 
with each other, the overlap of the highest occupied molecular orbital 
and the lowest unoccupied molecular orbital needs to be carefully 
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Figure 1 | Energy diagram and molecular structures of CDCBs. a, Energy 
diagram of a conventional organic molecule. b, Molecular structures of CDCBs. 
Me, methyl; Ph, phenyl. 
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balanced. Furthermore, to enhance the photoluminescence efficiency 
of a TADF material, the geometrical change in molecular conforma- 
tion between its Sp and S, states should be restrained to suppress non- 
radiative decay. Limited orbital overlap generally results in virtually no 
emission, as has been shown in benzophenone derivatives. Therefore, 
it was previously assumed that a high photoluminescence efficiency 
could never be obtained from molecules with a small AEs;. Here we 
demonstrate that it is possible to realize a high photoluminescence 
efficiency and a small AEs; simultaneously. 

We designed a series of highly efficient TADF emitters based on 
carbazolyl dicyanobenzene (CDCB), with carbazole as a donor and 
dicyanobenzene as an electron acceptor (Fig. 1b). Because the carba- 
zolyl unit is markedly distorted from the dicyanobenzene plane by 
steric hindrance, the highest occupied molecular orbital and the lowest 
unoccupied molecular orbital of these emitters are localized on the 
donor and acceptor moieties, respectively, leading to a small AEgr. 
Moreover, the dicyanobenzene and carbazolyl groups are important 
in obtaining a high photoluminescence efficiency and various emis- 
sion colours, respectively. Dicyanobenzene derivatives are known to 
alter their chemical bonds in excited states, changing their electronic 
properties'®. Density functional theory (DFT) calculations predicted 
that using CDCBs would have the following advantages. The cyano 
groups suppress both non-radiative deactivation and changes in the 
geometries of the S, and T, states of CDCBs, leading to a high quantum 
efficiency. Conversely, the emission wavelengths of CDCBs should be 
easy to tune by changing the electron-donating ability of the peripheral 
groups, which can be altered by changing the number of carbazolyl 
groups or introduced substituents. Such molecular design should 
allow us to achieve not only highly efficient TADF but also a wide 
range of emission colours. 

We synthesized CDCBs from commercially available starting 
materials in a one-step reaction. Palladium or other rare-earth-metal 
catalysts were not required, making CDCBs cost effective. Nucleo- 
philic aromatic substitution of carbazole anions generated by treat- 
ment with NaH and dicyanobenzenes at room temperature (300 K) 
yielded the CDCBs. All CDCBs were obtained in high yields of >79%, 
except 4CzPN and 2CzPN, which were obtained in lower yield (38% 
and 9%, respectively) because of purification problems. CDCBs 
were fully characterized by NMR and infrared spectroscopy, high- 
resolution mass spectrometry and elemental analysis (Methods). 
They had high thermal stability; for example, in thermogravimetric 
analysis measurements under nitrogen-flow conditions, 4CzIPN 
began to sublime at around 450 °C before decomposing. 

Ultraviolet-visible absorption and photoluminescence spectra of 
4CZzIPN in toluene are presented in Fig. 2a. 4CzIPN has intense green 
emission with a maximum at 507 nm and a high photoluminescence 
quantum yield (®) of 94+ 2%. The Stokes shift of 4CzIPN is very 
small; generally, the emission produced by intramolecular charge 
transfer between donor and accepter units shows a large Stokes 
shift’. Under nitrogen-saturated conditions, the delayed component 
(TADF), which had a lifetime of t ~ 5.1 + 0.5 Ls, was more than two 
orders of magnitude longer than the prompt component, for which 
t =~ 17.8 + 1 ns (Supplementary Information). To determine whether 
the T, state was involved in luminescence, the transient photolu- 
minescence and photoluminescence quantum yield of 4CzIPN were 
measured in toluene under an oxygen atmosphere. When oxygen was 
bubbled through the solution of 4CzIPN in toluene for 10 min, the 
lifetime of the delayed component became very short (t ~ 91 + 3 ns), 
the prompt component became t ~ 6.9 + 0.5ns and @ decreased to 
10%. These results suggest that 4CzIPN is a TADF material because the 
delayed fluorescence was substantially quenched by oxygen’. 

The geometry of the So state of 4CzIPN in the gas phase was opti- 
mized using DFT and the 6-31G(d) basis set'*. Geometry optimiza- 
tions of the S, and T, states were carried out using time-dependent 
DFT. All ab initio calculations were performed using Gaussian 09 
software’? without symmetry constraints. For 4CzIPN, generalized 
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Figure 2 | Photoluminescence characteristics of 4CzIPN. a, Ultraviolet- 
visible absorption and photoluminescence spectra of 4CzIPN in toluene at a 
concentration of 10-° moll” !.a.u., arbitrary units. b, c, Highest occupied NTO 
(b) and lowest unoccupied NTO (c) according to the results of time-dependent 
DFT for the S, state of 4CzIPN using the optimized structure of the So state. 
Colours indicate the different phases of the natural transition orbitals. 

d, Potential energy surfaces, vertical transition energies (AEs@S, and AE;@S,) 
and relaxation energies (As and /s*) for the S; state. 


gradient approximation functionals underestimated AE¢y (the experi- 
mental value is 83 meV), whereas long-range corrected functionals 
overestimated it (Supplementary Information). The M06-2X func- 
tional”, a hybrid meta-generalized gradient-approximation functional, 
yields an intermediate value for AEs; and, moreover, by taking into 
account solvent effects using the polarizable continuum model’’”’, 
predicts an emission wavelength of 460 nm, which is reasonably close 
to the experimentally obtained photoluminescence peak at 507 nm in 
toluene solution. In the following discussion, we use the results cal- 
culated for M06-2X/6-31G(d). The highest occupied and lowest 
unoccupied natural transition orbitals”? (NTOs) based on the results 
of time-dependent DFT for the S, state of 4CzIPN using the optimized 
structure of the Sp state are depicted in Fig. 2b and Fig. 2c, respectively. 
The highest occupied NTO is delocalized over the four carbazolyl 
moieties, whereas the lowest unoccupied NTO is centred on the dicya- 
nobenzene moiety, suggesting that the S, state has charge transfer 
character; the carbazolyl groups act as electron donors and the 
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dicyanobenzene group acts as an electron acceptor. Steric hindrance 
between the carbazolyl and dicyanobenzene moieties causes a large 
dihedral angle of about 60° between the planes of the carbazolyl 
and dicyanobenzene groups. Consequently, the highest occupied and 
lowest unoccupied NTOs are spatially well separated, leading to a small 
AEsy and enhanced T — S, reverse ISC. 

As stated above, the Stokes shift of the S, state of 4CzIPN is small 
compared with those for typical charge transfer states, suggesting that 
the S,<—So excitation results in limited relaxation of the molecular 
geometry. We believe that this small Stokes shift is partly related to 
the presence of the cyano groups. To investigate the effect of the cyano 
groups on the geometry relaxation of the S, and T; states of 4CzIPN, we 
evaluated vertical transition energies (AEs@Sp and AEs@S,) and relaxa- 
tion energies (A; and A,*) for the S, state (Fig. 2d), and compared them 
with those of the molecule in which the two cyano groups of 4CzIPN are 
replaced with hydrogen atoms (4CzBz; Supplementary Information). 
The reorganization energies As for 4CzIPN and 4CzBz were calculated 
to be 0.27 and 0.83 eV, respectively. This energy is greatly reduced by 
introducing cyano groups into the electron-accepting unit. Because As 
represents the degree of geometry relaxation of the S; state to the So 
state, this result suggests that the cyano groups are important in sup- 
pressing geometry relaxation in the fluorescent state of 4CzIPN. In 
addition, A;* is also reduced by the presence of cyano groups. 
Torsional angles of the carbazolyl groups are calculated to be small in 
the presence of the cyano groups. This limited torsional flexibility can be 
a major factor in reducing the non-radioactive decay of 4CzIPN. Like 
that for the S, state, the relaxation energy for the T, state (7) is mark- 
edly reduced by the cyano groups. Thus, it is probable that the cyano 
groups suppress non-radiative deactivation from the S, and T, states, 
leading to the high photoluminescence quantum efficiency of 4CzIPN. 

Photoluminescence spectra of the CDCBs in toluene are presented 
in Fig. 3. The series of CDCBs yielded a wide range of emission colours 
ranging from sky blue (473 nm) to orange (577 nm). The emission 
wavelength depends on the electron-donating and -accepting abilities 
of the peripheral carbazolyl groups and the central dicyanobenzene 
unit, respectively. Introduction of methyl or phenyl substituents at the 
3- and 6- positions of the carbazolyl groups of 4CzTPN induces a shift 
of the emission maximum to longer wavelengths. Conversely, in the 
case of 2CzPN, the presence of fewer carbazolyl groups reduces its 
electron-donating ability and produces a shift of the emission maxi- 
mum to shorter wavelengths. We measured the photoluminescence 
quantum yield and transient photoluminescence of CDCBs in toluene 
under a nitrogen atmosphere, and are summarized in Supplementary 
Information. For 4CzPN and 4CzTPN @ is high (7443% and 
72 + 3%, respectively), whereas for 4CzTPN-Me, 4CzTPN-Ph and 
2CzZPN it is lower (47+ 2%, 26+ 1% and 47 + 2%, respectively) 
because of substituent effects or fewer carbazolyl groups. Because 
the transient photoluminescence of all CDCBs showed both a nano- 
second-scale prompt component and a microsecond-scale delayed 
component, the CDCBs were confirmed to be TADF materials. 

Figure 4a shows the photoluminescence decay curves for emission 
of 4CzIPN at 100, 200 and 300K ina 5+1wt% 4CzIPN:4,4’-bis 
(carbazol-9-yl)biphenyl (CBP) film. The triplet excitons of 4CzIPN 
are well confined using a CBP host because the T, state of CBP is 
higher in energy than the S; state of 4CzIPN. In addition, the fluo- 
rescence of CBP is completely quenched by efficient energy transfer 
between the guest and host molecules. The intense emission observed 
around t = 0s corresponds to the prompt component, and the long tail 
is the delayed fluorescence. The prompt component is assigned to the 
fluorescence of 4CzIPN. The delayed component is attributed to 
delayed fluorescence occurring via reverse ISC, that is, TADF, because 
the photoluminescence spectrum of the delayed fluorescence is iden- 
tical to that of the prompt fluorescence (Fig. 4b). 

Figure 4c shows the temperature dependence of @ for the prompt 
and delayed components of the film. The two components were 
resolved by combining the absolute value of ® estimated using an 
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Figure 3 | Photoluminescence of the CDCB series. a, Photoluminescence 
spectra measured in toluene. b, Photograph under irradiation at 365 nm. 


integrated-sphere photoluminescence measurement system and the 
temperature dependence of the photoluminescence decay curves 
(Supplementary Fig. 2). The prompt component increases very slightly 
as the temperature decreases, indicating the suppression of non- 
radiative decay from the S, state. Conversely, the delayed component 
decreases monotonically as the temperature decreases because reverse 
ISC becomes the rate-determining step, similar to the temperature 
dependence of tin 1v fluoride/porphyrin complexes, which are typical 
TADF emitters'*. At room temperature (300 K), a high @ value, of 
83 + 2%, was observed. To evaluate AEs; quantitatively, we estimated 
the activation energy of the reverse ISC rate constant (kpisc) from 
exp(—AEsq/kgT), where kg is the Boltzmann constant and T is tem- 
perature. This rate constant can be estimated from experimentally 
determined rate constants and the ® values of the prompt and delayed 
components at each temperature using™* 


kpisc = —— (1) 


where k, and kg are the rate constants of the prompt and delayed 
fluorescence components, respectively; kjgc is the ISC rate constant 
from S, to T, states; and ®, and ®, are the photoluminescence 
quantum yields of the prompt and delayed components, respectively. 
In Fig. 4d, the values of kysc calculated from equation (1), assuming 
that kjgc was independent of temperature, are plotted against 1/T for 
T = 200-300K. From the Arrhenius plot (Fig. 4d), we estimate an 
activation energy of 83 meV. Therefore, kpjsc would be suppressed 
considerably at low temperatures. However, even at low temperatures, 
the ® value of the delayed component is still high, at >40%, implying 
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Figure 4 | Temperature dependence of photoluminescence characteristics 
of a5 + 1 wt% 4CzIPN:CBP film. a, Photoluminescence decay curves of a 
6 wt% 4CzIPN:CBP film at 300 K (black line), 200 K (red line) and 100 K (blue 
line). The photoluminescence decay curves show integrated 4CzIPN emission. 
The excitation wavelength of the films was 337 nm. b, Photoluminescence 
spectrum resolved into prompt and delayed components. c, Temperature 
dependence of photoluminescence quantum efficiencies (errors are within 2%) 
for combined (prompt plus delayed; black squares), prompt (red circles) and 
delayed (blue triangles) components of 4CzIPN emission for 5 + 1 wt% 
4CzIPN:CBP film. The straight lines are guides for the eye. d, Arrhenius plot of 
the reverse ISC rate from the triplet state to the singlet state of 4CzIPN with kisc 
set to 4X 10’s_'. The straight line (least-squares regression) is used to 
determine the activation energy. The In(kpisc) errors are within 0.2. 


that kpisc is higher than the non-radiative rate constant of the T, state. 
It should be noted that for temperatures less than 200K the decay 
curves do not agree with the double-exponential decay model, but 
are described well by a multi-exponential decay model. This can be 
explained by the widened AEsy distribution caused by inhomogeneous 
molecular environments at lower temperatures. 

We then evaluated the performance of OLEDs containing the 
CDCB derivatives 4CzIPN (green emission), 4CzTPN-Ph (orange 
emission) and 2CzPN (sky-blue emission) as emitters. Figure 5 shows 
the external electroluminescence quantum efficiency of OLEDs con- 
taining the CDCB derivatives. To achieve high electroluminescence 
efficiency in these OLEDs, the T, state of the CDCB derivatives must 
be confined using a host material with a higher triplet energy level. 
Therefore, we used CBP as a host material in the green and orange 
OLEDs and used 2,8-bis(diphenylphosphoryl)dibenzo|b,d]thiophene 
(PPT) as a host material in the sky-blue OLED. The structures of the 
OLEDs were composed of multiple layers of indium tin oxide (ITO), 
4,4-bis[ N-(1-naphthyl)-N-phenylamino]-biphenyl («-NPD, 35 nm), 
5 +1wt% 4CzIPN or 5+ 1wt% 4CzTPN-Ph:CBP (15nm), TPBi 
(65nm), LiF (0.8nm), Al (70nm) and ITO (100nm), o«-NPD 
(40nm), 1,3-bis(9-carbazolyl)benzene (mCP, 10nm), 5+ 1wt% 
2CzPN:PPT (20 nm), PPT (40 nm), LiF and Al. For the green OLED, a 
very high external electroluminescence quantum efficiency, of 
19.3 + 1.5%, was achieved, which is equivalent to an internal electro- 
luminescence quantum efficiency of 64.3-96.5% assuming a light out- 
coupling efficiency of 20-30% (refs 25, 26). The orange and sky-blue 
OLEDs had external electroluminescence quantum efficiencies of 
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Figure 5 | Performance of OLEDs containing CDCB derivatives. External 
electroluminescence quantum efficiency as a function of current density for 
OLEDs containing 4CzIPN (green circles; error within 1.5%), 4CzTPN-Ph (red 
triangles; error within 1.0%) and 2CzPN (blue triangles; error within 1.0%) as 
emitters. Inset, electroluminescence spectra of the same OLEDs (coloured 
accordingly) at a current density of 10mAcm 7. 


11.2 + 1% and 8.0 + 1%, respectively, which are also higher than those 
of conventional fluorescence-based OLEDs. 

Finally, we consider the mechanism that drives such efficient reverse 
ISC without heavy metals. It is generally accepted that the introduction 
of spin-orbit coupling provided by heavy atoms is required for both 
ISC and reverse ISC to be efficient. Thus, metal complexes containing 
heavy metals and aromatic compounds with halogens and carbonyls 
can promote efficient spin conversion. However, our novel molecular 
design produces highly efficient spin conversion without needing 
such atoms. This is because the first-order mixing coefficient between 
singlet and triplet states (A) is inversely proportional to AEcy (ref. 27): 


Ho 
2 
AEs: (2) 


Here Hgo is the spin-orbit interaction. It follows from equation (2) 
that heavy atoms are not required to achieve efficient spin conversion 
when a molecule possesses a small AEs; and Hgo is not vanishingly 
small. This broadens the scope for the molecular design of TADF 
materials. A detailed strategy for large / will be clarified on the basis 
of correlated quantum-chemical calculations”. 


A 


METHODS SUMMARY 

Synthesis of carbazolyl dicyanobenzene. The synthesis of CDCB derivatives was 
performed according to the synthetic method reported previously for pyrrole and 
fluorobenzene derivatives’. CDCBs were synthesized by reaction of a carbazolyl 
anion with a fluorinated dicyanobenzene at room temperature (300 K) for 10h 
under a nitrogen atmosphere. CDCBs were purified by column chromatography 
on silica gel, by reprecipitation or both. CDCBs were further purified by sublima- 
tion before photoluminescence and electroluminescence spectra were measured. 
Photoluminescence measurements. Organic films for optical measurements 
were fabricated by thermal evaporation onto clean quartz and silicon substrates 
under high vacuum (~7 X 10-*Pa). The photoluminescence spectra of these 
films were recorded with a spectrofluorometer (FluoroMax-4, Horiba Jobin 
Yvon), and the photoluminescence quantum efficiencies were measured using 
an absolute photoluminescence quantum yield measurement system (C9920-02, 
Hamamatsu Photonics). The transient photoluminescence characteristics were 
measured (in terms of photon number) under vacuum using a streak camera 
(C4334, Hamamatsu Photonics). A nitrogen gas laser with a wavelength of 
337 nm and a pulse width of approximately 500 ps (MNL200, Lasertechnik) was 
used as an excitation source. Low-temperature measurements were made using a 
cryostat (CRT-006-2000, Iwatani Industrial Gases). InGa alloy was applied as an 
adhesive to ensure good thermal conductivity between the silicon substrate and 
sample holder. 
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OLED fabrication and electroluminescence measurements. The OLED devices 
were fabricated by thermal evaporation onto clean ITO-coated glass substrates 
under high vacuum (~7 X 10 *Pa). Current density/voltage/luminance (J- V-L) 
characteristics were measured using a semiconductor parameter analyser (E5273A, 
Agilent) with an optical power meter (1930C, Newport). Electroluminescence 
spectra of OLEDs were measured (in terms of photon number) using a calibrated 
multichannel spectrometer (PMA12, Hamamatsu Photonics). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Synthesis of carbazolyl dicyanobenzene. NMR spectra were obtained with a 
Bruker Biospin Avance-III 500 NMR spectrometer at ambient temperature. 
Because 4CzTPN, 4CzTPN-Me and 4CzTPN-Ph were not sufficiently soluble in 
common deuterated solvents, their '*C NMR spectra were not obtained. Infrared 
spectra were measured with a JASCO FT/IR-620 infrared spectrophotometer. 
High-resolution mass spectrometry (HRMS) by fast atom bombardment was done 
using a JEOL JMS-700 spectrometer. Matrix-assisted laser desorption/ionization- 
time of flight (MALDI-TOF) mass spectrometry was done using a BRUKER 
Autoflex-III. Elemental analysis was performed with a Yanaco MT-5 elemental 
analyser. All solvents and chemicals were reagent grade, obtained commercially 
and used without further purification except as noted. Dry tetrahydrofuran (THF) 
was purchased from Wako Pure Chemical Industries, Ltd. The synthesis of CDCB 
derivatives was performed according to the synthetic method reported previously 
for pyrrole and fluorobenzene derivatives”. CDCBs were purified by sublima- 
tion before photoluminescence and electroluminescence spectra were measured. 

We used the following general procedure for the synthesis of tetracarbazolyl 
dicyanobenzenes. Sixty per cent NaH in oil (0.48 g, 12 mmol) was washed with 
hexane and then added to a stirred solution of carbazole derivative (10.0 mmol) in 
dry THF (40 ml) under a nitrogen atmosphere at room temperature. After stirring 
for 30 min, tetrafluorophthalonitrile, tetrafluoroisophthalonitrile, or tetrafluoroter- 
ephthalonitrile (0.400 g, 2.00 mmol) was added. The reaction mixture was stirred at 
room temperature for 10h. The reaction was quenched with water (5 ml). The 
mixture was concentrated under reduced pressure to yield the crude product. 

We synthesized 1,2,3,4-tetrakis(carbazol-9-yl)-5,6-dicyanobenzene (4CzPN) 
according to the general procedure. The crude product was added to water, soni- 
cated for 10 min and then filtered. The residue was purified by column chromato- 
graphy on silica gel (CHCl3:hexane, 2:1 v/v and then CHCl3:acetone, 2:1 v/v) and 
then reprecipitated from CHCl;/MeOH to produce 4CzPN as an orange powder 
(0.602 g, 0.764 mmol, 38%). 

The results of our analyses are as follows. Thin-layer chromatography (TLC) 
(CHCl,:hexane, 4:1 v/v): retardation factor (R;), 0.20. 'H NMR (500 MHz, 
DMSO-d,): 5 (p.p.m.) = 7.90 (m, 4H), 7.73 (m, 4H), 7.41 (m, 8H), 7.17-7.12 
(m, 8H), 6.76 (m, 4H), 6.62 (m, 4H). °C NMR (125MHz, acetone-d,): 
6 (p.p.m.) = 141.58, 141.05, 138.75, 137.52, 125.13, 124.03, 123.04, 122.79, 
121.06, 120.46, 120.10, 119.29, 119.01, 113.41, 111.17, 111.06. Infrared spectro- 
scopy (KBr): absorption wavelength cm '), 2,228, 1,490, 1,457, 1,446, 1,310, 
1,223. HRMS (m/z): [M+H]* calculated for CsgH33N,, 789.2766; found, 
789.2767; analysis (calculated, found for Cs6H32Ng): C (85.26%, 85.16%), H 
(4.09%, 4.02%), N (10.65%, 10.55%). 

We synthesized 1,2,3,5-tetrakis(carbazol-9-yl)-4,6-dicyanobenzene (4CzIPN) 
according to the general procedure. The crude product was added to water, soni- 
cated for 10 min and then filtered. The residue was purified by column chromato- 
graphy on silica gel (hexane:CHCl, 5:1 v/v and then acetone:CHCls, 5:1 v/v) and 
reprecipitated from acetone/CHCI, to produce 4CzIPN asa yellow powder (1.41 g, 
1.79 mmol, 90%). 

The results of our analyses are as follows. TLC (acetone:CHCl;, 5:1 v/v): 
Rp = 0.25. 'H NMR (500 MHz, CDCI): 6 (p.p.m.) = 8.33 (d, J=7.5 Hz, 2H), 
8.06 (d, J = 8.0 Hz, 2H), 7.83 (dd, J = 8.0, 1.5 Hz, 4H), 7.71-7.67 (m, 6H), 7.47 
(m, 4H), 7.43 (d, J = 7.5 Hz, 2H), 7.14-7.08 (m, 8H), 6.82 (m, 2H), 6.71 (m, 2H). 
"°C NMR (125 MHz, acetone-dg): 6 (p.p.m.) = 146.88, 146.20, 141.31, 139.84, 
138.82, 137.50, 127.74, 126.41, 125.34, 125.32, 125.03, 124.56, 122.85, 122.32, 
121.80, 121.50, 120.98, 120.26, 118.58, 112.99, 111.79, 111.64, 111.36. Infrared 
spectroscopy (KBr): absorption wavelength (cm), 2,233, 1,545, 1,490, 1,479, 
1,458, 1,335, 1,309, 1,222. HRMS (m/z): [M+H]* calculated for CssH33No 
789.2766; found, 789.2781; analysis (calculated, found for Cs6H32Ng.): C 
(85.26%, 85.22%), H (4.09%, 4.03%), N (10.65%, 10.62%). 

We synthesized 1,2,4,5-tetrakis(carbazol-9-yl)-3,6-dicyanobenzene (4CzTPN) 
according to the general procedure. The crude product was added to water, soni- 
cated for 10 min and then filtered. The residue was purified by column chromato- 
graphy on silica gel (CHCl; and then acetone) and reprecipitated from acetone/ 
CHCI; to produce 4CzTPN as an orange powder (1.50 g, 1.90 mmol, 95%). 

The results of our analyses are as follows. TLC (acetone): Rp = 0.05. 'H NMR 
(500 MHz, DMSO-dg): 6 (p.p.m.) = 7.91 (m, 16H), 7.25 (m, 8H), 7.16 (m, 8H). 
Infrared spectroscopy (KBr): absorption wavelength (cm '), 2,235, 1,490, 1,480, 
1,455, 1,334, 1,224. HRMS (m/z): [M+H]” calculated for CssH33N¢, 789.2766; 
found, 789.2781; analysis (calculated, found for Csg5H3 Ng): C (85.26%, 85.28%), H 
(4.09%, 4.11%), N (10.65%, 10.61%). 

We synthesized 1,4-dicyano-2,3,5,6-tetrakis(3,6-dimethylcarbazol-9-yl)ben- 
zene (4CzTPN-Me) according to the general procedure. The crude product was 
added to MeOH (200 ml) and then filtered. The residue was washed with water 
and MeOH to produce 4CzTPN-Me as an orange powder (1.72 g, 1.91 mmol, 
95%). 


LETTER 


The results of our analyses are as follows. TLC (acetone): Rp = 0.05. 1H 
NMR (500 MHz, DMSO-dg): 6 (p.p.m.) = 7.81 (d, J = 8.5 Hz, 8H), 7.71 (s, 8H), 
7.11 (m, 8H), 2.37 (s, 24H). Infrared spectroscopy (KBr): absorption wavelength 
(cm '), 2,241, 1,461, 1,319, 1,281, 1,228. HRMS (m/z): [M+H]* calculated for 
Co4Ha9No, 901.4019; found, 901.4027; analysis (calculated, found for Cg4H4gNo): 
C (85.30%, 85.39%), H (5.37%, 5.36%), N (9.33%, 9.35%). 

We synthesized 1,4-dicyano-2,3,5,6-tetrakis(3,6-diphenylcarbazol-9-yl)ben- 
zene (4CzTPN-Ph) according to the general procedure. The crude product was 
added to water, sonicated for 10 min and then filtered. The residue was purified 
by column chromatography on silica gel (CHCl; and then acetone) and repre- 
cipitated from acetone/CHCl; to produce 4CzTPN-Ph as an orange powder 
(2.20 g, 1.57 mmol, 79%). 

The results of our analyses are as follows. TLC (acetone): Rp = 0.05. 'H NMR 
(500 MHz, DMSO-dg): 6 (p.p.m.) = 8.37 (d, J = 1.5 Hz, 8H), 8.05 (d, J = 8.5 Hz, 
8H), 7.70 (m, 16H), 7.62 (dd, J = 8.5, 1.5 Hz, 8H), 7.45 (m, 16H), 7.36 (m, 8H). 
Infrared spectroscopy (KBr): absorption wavelength (cm~'), 2,236, 2,228, 1,600, 
1,476, 1,456, 1,441, 1,290, 1,226. MALDI-TOF MS (m/z): [M]* calculated for 
CiosHeaNo, 1396.52; found, 1396.66; analysis (calculated, found for Cyo4H¢4Ne): 
C (89.37%, 89.26%), H (4.62%, 4.53%), N (6.01%, 5.95%). 

To synthesize 1,2-bis(carbazol-9-yl)-4,5-dicyanobenzene (2CzPN), 60% NaH 
in oil (0.96g, 24mmol) was washed with hexane and then added to a 
stirred solution of carbazole derivative (3.34 g, 20.0 mmol) in dry THF (80 ml) 
under a nitrogen atmosphere at room temperature. After stirring for 30 min, 4,5- 
difluorophthalonitrile (1.31 g, 8.00 mmol) was added. The reaction mixture was 
stirred at room temperature for 10h. The reaction was quenched with water 
(20 ml). The mixture was concentrated under reduced pressure. The residue was 
washed with water, purified by column chromatography on silica gel (CHCI;:hexane, 
4:1 v/v) and reprecipitated from acetone/hexane to produce 2CzPN as a pale yellow 
powder (0.310 g, 0.676 mmol, 8.5%). 

The results of our analyses are as follows. TLC (CHCls:hexane, 4:1 v/v): Rp = 0.23. 

'H NMR (500 MHz, acetone-dg): 5 (p.p.m.) = 8.73 (s, 2H), 7.79 (m, 4H), 7.38 (m, 
4H), 7.13-7.08 (m, 8H). °C NMR (125 MHz, acetone-dg): 6 (p.p.m.) = 139.86, 
139.57, 137.14, 126.76, 124.90, 121.99, 120.96, 116.39, 115.88, 110.74. Infrared spec- 
troscopy (KBr): absorption wavelength (cm), 2,236, 1,596, 1,507, 1,478, 1,446, 
1,335, 1,221. HRMS (m/z): [M]* calculated for C3,H,gN,, 458.1531; found, 
458.1529; analysis (calculated, found for C3,H;gN4): C (83.82%, 83.81%), H 
(3.96%, 3.80%), N (12.22%, 12.19%). 
Photoluminescence measurements. Organic films for optical measurements were 
fabricated by thermal evaporation onto clean quartz and silicon substrates under 
high vacuum (~7 X 10 *Pa). The photoluminescence spectra of these films were 
recorded with a spectrofluorometer (FluoroMax-4, Horiba Jobin Yvon), and the 
photoluminescence quantum efficiencies were measured using an absolute photo- 
luminescence quantum yield measurement system (C9920-02, Hamamatsu Photo- 
nics). The transient photoluminescence characteristics were measured (in terms 
of photon number) under vacuum using a streak camera (C4334, Hamamatsu 
Photonics). A nitrogen gas laser with a wavelength of 337 nm and a pulse width of 
approximately 500 ps (MNL200, Lasertechnik) was used as an excitation source. 
Low-temperature measurements were conducted using a cryostat (CRT-006-2000, 
Iwatani Industrial Gases). InGa alloy was applied as an adhesive to ensure good 
thermal conductivity between the silicon substrate and sample holder. 

Temperature-dependent photoluminescence characteristics of the prompt and 
delayed components for a 6 wt% 4CzIPN:CBP film were calculated as follows. 
Supplementary Fig. 2 shows the temperature-dependent lifetime of the prompt 
component for a 6 wt% 4CzIPN:CBP film. Assuming that the lifetime of singlet 
excited states is much longer than those of triplet excited states, the respective rate 
constants of the prompt and delayed fluorescence components (k, and kg), in the 
presence of mutual ISC between the singlet and triplet states, are given by“ 


kp = K+ +hisc 


kisc ) k 
RISC 
Ke +k, +kisc 
where k? and k®. are the radiative and non-radiative rate constants of the singlet 
state, respectively; kigc is the ISC rate constant from singlet to triplet states; and ky, 


is the non-radiative rate constant of the triplet state. Therefore, the photolumine- 
scence quantum efficiency of the prompt component is given by 


S 
ke S_S 
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kg=k+ (: 


where T° is the lifetime of the singlet state. By combining the absolute photolumi- 
nescence efficiency estimated using an integrated-sphere photoluminescence 
measurement system and the temperature dependence of the photoluminescence 
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decay curve at T = 300K, we estimated kS to be 1.27 X 10’ s”'. This allowed us to 
estimate ®, from the temperature-dependent lifetime of the prompt compo- 
nent, assuming that kS is independent of temperature. Using the total photolumi- 
nescence quantum efficiency estimated from the temperature dependence of the 
photoluminescence intensity, the photoluminescence quantum efficiency of the 
delayed component (®g) can also be estimated. Supplementary Table 4 shows 
the photoluminescence characteristics of a 6 wt% 4CzIPN:CBP film at each tem- 
perature. We estimated kgisc using these experimental data. 


OLED fabrication and electroluminescence measurements. The OLED devices 
were fabricated by thermal evaporation onto clean ITO-coated glass sub- 
strates under high vacuum (~7 X 10 *Pa). Current density/voltage/luminance 
(J-V-L) characteristics were measured using a semiconductor parameter ana- 
lyser (E5273A, Agilent) with an optical power meter (1930C, Newport). 
Electroluminescence spectra of OLEDs were measured (in terms of photon 
number) using a calibrated multichannel spectrometer (PMA12, Hamamatsu 
Photonics). 
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Increased future ice discharge from Antarctica owing 


to higher snowfall 


R. Winkelmann!?, A. Levermann'”, M. A. Martin’? & K. Frieler! 


Anthropogenic climate change is likely to cause continuing global 
sea level rise’, but some processes within the Earth system may 
mitigate the magnitude of the projected effect. Regional and global 
climate models simulate enhanced snowfall over Antarctica, which 
would provide a direct offset of the future contribution to global 
sea level rise from cryospheric mass loss”* and ocean expansion‘. 
Uncertainties exist in modelled snowfall’, but even larger uncer- 
tainties exist in the potential changes of dynamic ice discharge from 
Antarctica’® and thus in the ultimate fate of the precipitation- 
deposited ice mass. Here we show that snowfall and discharge are 
not independent, but that future ice discharge will increase by up to 
three times as a result of additional snowfall under global warming. 
Our results, based on an ice-sheet model’ forced by climate simula- 
tions through to the end of 2500 (ref. 8), show that the enhanced 
discharge effect exceeds the effect of surface warming as well as that 
of basal ice-shelf melting, and is due to the difference in surface 
elevation change caused by snowfall on grounded versus floating 
ice. Although different underlying forcings drive ice loss from basal 
melting versus increased snowfall, similar ice dynamical processes 
are nonetheless at work in both; therefore results are relatively 
independent of the specific representation of the transition zone. 
In an ensemble of simulations designed to capture ice-physics 
uncertainty, the additional dynamic ice loss along the coastline 
compensates between 30 and 65 per cent of the ice gain due to 
enhanced snowfall over the entire continent. This results in a 
dynamic ice loss of up to 1.25 metres in the year 2500 for the 
strongest warming scenario. The reported effect thus strongly 
counters a potential negative contribution to global sea level by 
the Antarctic Ice Sheet. 

During the past decade, the Antarctic Ice Sheet has lost volume at 
a rate comparable to that of Greenland®. The enhanced moisture- 
carrying capacity ofa warming atmosphere, on the other hand, strongly 
suggests increasing snowfall over Antarctica, as projected by global? 
and regional climate models’. This may lead to a net negative contri- 
bution of Antarctica to future sea level, depending on the magnitude of 
dynamic effects possibly compensating or even overcompensating this 
ice gain. Because surface melt will remain small even under strong 
climate change*’, the major uncertainty in sea-level projections from 
Antarctica arises from the unknown dynamic contribution. The main 
mechanisms for dynamic ice loss in future climate scenarios discussed 
so far are softening of the ice due to surface warming”, acceleration of 
ice streams through the reduction or loss of an adjacent ice-shelf*’ and 
an increase in driving stress at the grounding line due to enhanced 
basal melting at the underside of the shelves caused by a rise in ocean 
temperature’, all potentially resulting in higher ice flux across the 
grounding line (separating the grounded ice sheet from the floating 
ice shelves) (Fig. 1). 

In a perturbed ice-physics ensemble, we estimate a significant addi- 
tional effect of dynamic ice discharge, which is caused by the projected 
increase in snowfall itself. As detailed below, this effect makes the 
largest contribution to future ice loss, as projected by the conti- 
nental-scale Potsdam Parallel Ice-Sheet Model (PISM-PIK)’ with a 


consistent representation of the flow in the ice sheet, ice shelves and 
the transition zone relevant for the fast dynamics'*. The underlying 
mechanism is simple and very robust, and consequently the effect is 
found along almost the entire coastline of Antarctica (Supplementary 
Fig. 1). Itis due to a steepening of the surface gradient near the ground- 
ing line and the associated increase in the driving stress acting on the 
ice flow: additional snowfall increases the surface elevation of 
grounded ice as well as that of the floating ice shelf. Owing to the 
floating the resulting change in surface elevation is, however, ten times 
larger on grounded ice than on the floating shelf for the same amount 
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Figure 1 | Main mechanisms of dynamic ice loss. a, Conceptual figure 
illustrating the various forcings acting on the ice sheet. Inset, velocity field of a 
present-day Antarctic Ice Sheet simulation with PISM-PIK, which models the 
different flow regimes in sheet, streams and shelves in a consistent manner. The 
approach allows for the transition from vertical-shearing dominated flow of 
grounded ice to spreading-dominated flow in ice shelves. The resulting range in 
ice velocity spans several orders of magnitude and compares well with 
observational data’. Velocity profiles are illustrated as blue lines for the 
vertical-shearing-dominated and red lines for the spreading-dominated flow. 
b, Ice contours for the years 2000 (light grey) and 2500 (black) of the ECP 8.5 
scenario with enhanced snowfall (for the section depicted in Supplementary 
Fig. 1b). Enhanced snowfall leads to an increase in surface elevation on the 
sheet, whereas the elevation of the ice shelf remains almost constant because it 
floats. The corresponding driving stress with snowfall change (blue) is 
significantly higher than for the simulation with warming only (red). 
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of accumulated snow. In the simulations, this increase in surface gra- 
dient can be observed in all coastal regions of the Antarctic Ice Sheet; it 
is also observed when the initial states are forced by snowfall changes 
projected by regional climate models. It yields an increase in driving 
stress and hence ice transport across the grounding line (as shown for 
an exemplary profile of the coastal region of the Amundsen Sea sector 
in Fig. 1b). As a rule of thumb, the effect is larger than the increase in 
driving stress induced by enhanced sub-shelf melting as long as the 
rate of basal melt is less than nine times the snowfall, which is generally 
the case in all projections presented here (Fig. 2). Under very strong 
climatic forcing a few regions, such as the catchment area of the Ross 
Ice Shelf, show a retreat of the grounding line. Here additional snowfall 
may lead to an increase in the buttressing ability of the ice shelf, 
countering the effects of enhanced sub-shelf melting and limiting 
the acceleration of ice flux upstream of the grounding line. In all other 
regions and under less extreme climatic forcing, snowfall enhances 
ice loss. 

Although projections of Antarctic temperature and snowfall changes 
are subject to substantial uncertainty’, a robust feature of both global 
and regional model simulations that is relevant for the dynamic effect 
presented here is the increase in snowfall along the coastal margins’. 
The climate scenarios applied here are based on the Representative 
Concentration Pathways (RCPs), which span the full range of radiative 
forcing from 2.6 to 8.5 W m_” projected up until the end of the twenty- 
first century; after this, we used the respective Extended Concentration 
Pathways (ECPs)"* until 2500. Using a statistical downscaling method®, 
the corresponding global-mean temperature changes (as obtained 
from emulations of the full range of CMIP3 atmosphere-ocean general 
circulation model (AOGCM) responses’* using MAGICC 6.0'°) were 
transferred into scenarios for the surface temperature over Antarctica 
and the ocean temperature near the major ice shelves. The basal-melt 
rate underneath each of the shelves was computed from the shelf’s 
geometry and the temperature and salinity at its cavity entrance, using 
a simplified ocean model for the circulation in sub-ice-shelf cavities”. 
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Figure 2 | Comparison of drivers of dynamic ice loss. Given is the model 
spread (for the perturbed-physics ensemble) of the surface elevation changes at 
the grounding lines of the Ross (RIS), Filchner-Ronne (FRIS), Amery (AIS) and 
Larsen (LAR) Ice Shelves in the year 2100 compared to pre-industrial values. 
Whereas enhanced basal melting induces a surface elevation change of [1 — (p;/ 
Pw)]B at the grounding line (shown in red), the surface elevation change due to 
snowfall (shown in blue) is given by P,, — [1 — (pi/Pw)] Pa, where p;, and py are 
the density of ice and ocean water, respectively, B is the sub-shelf melt rate at the 
grounding line and P,, and Pg denote the snowfall rate upstream and 
downstream of the grounding line, respectively. Both the sub-shelf melt rate B 
and the snowfall rate P are given in metres per year. Snowfall-driven ice loss will 
thus dominate basal-melt driven ice loss if the snowfall rate at the grounding 
line is larger than approximately one-ninth of the basal melt rate at the 
grounding line. 
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Recently observed values for the Ross’®, Filchner’?-Ronne”’ and Amery”’ 
Ice Shelves as well as the shelf attached to Pine Island Glacier” 
are captured by the modelled range for the year 2000. Precipitation 
changes are parameterized following ref. 23 and account for tempera- 
ture increase as well as for changes in ice topography. As detailed else- 
where, we seek to capture the uncertainty that arises from different 
global climate sensitivity as well as the uncertainty due to ice-model 
parameters. To this end, each ECP scenario is represented by the median 
and the upper and lower 33rd percentile of the uncertainty distribution 
for the Antarctic temperature pathways originating from the inter- 
AOGCM variations and the uncertainty of the global mean temperature 
change from MAGICC 6.0. Furthermore, each of these 4 * 3 (scenarios 
X percentiles) simulations was carried out starting from an ensemble of 
81 initial states with different ice-parameter settings. All of these initial 
states closely resemble present-day Antarctica in ice volume and geo- 
metry, but differ in ice softness and the friction between ice sheet and 
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Figure 3 | Time series of snowfall-induced ice loss. a, Difference in solid ice 
loss between full scenario simulations and simulations with warming only, 
given in metres of sea-level equivalent. The additional ice loss due to enhanced 
snowfall increases for all scenarios with time. The spread from unknown 
physics captured in the 81-member ensemble is smaller than the range spanned 
by the medians for each scenario. b, Associated additional solid ice discharge, 
given in metres of sea-level equivalent per century. The model is in principle 
capable of reproducing the currently observed solid ice discharge from 
Antarctica (horizontal grey lines)° of 0.2-0.4 mm yr ' for the years 2000 to 
2009 and the associated acceleration within this time-period (thick black line; 
dashed lines indicate x-intercepts). These values are, however, reached later 
within the twenty-first century. As an example, the additional solid ice 
discharge is marked for one specific ensemble member by a thin black line, 
illustrating that the currently observed acceleration is within the variability of 
the additional ice discharge projected for ECP 8.5. The present study does not 
rely on the exact projection of ice loss from Antarctica but reports the relative 
importance of enhanced snowfall for the increase in future ice discharge 
compared to surface and ocean warming. 
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Figure 4 | Snowfall-induced ice loss compared to ice gain from precipitation 
and ice loss from warming only. All values are given in metres of sea-level 
equivalent. a, The additional ice loss under the ECP scenarios correlates with 
the cumulative amount of additional snowfall during the simulations. It 
compensates between 30% and 65% (shown as dashed black lines) of the ice 
gain due to enhanced snowfall. The percentage depends on the climate 
scenario—it is highest for the weak ECP 2.6 and lowest for the strong ECP 8.5. 


basal topography and therefore in their sensitivity to a change in the 
climatic boundary conditions. For details of the model and ensemble 
simulations, see Methods section and ref. 24. 

To estimate the influence of snowfall on ice discharge, we conducted 
simulations with surface and oceanic warming only as well as simula- 
tions with additional snowfall using the entire 81-member ensemble. 
Whereas the median of warming-induced ice loss in the year 2500 
ranges from 0.04 m sea-level equivalent for the ECP 2.6 scenario to 
0.65 m for the ECP 8.5 scenario, additional snowfall leads to a negative 
total mass balance in the range from —0.08 m for the ECP 2.6 scenario 
to -0.56 m sea-level equivalent for the ECP 8.5 scenario. These esti- 
mates of future mass loss are likely to be conservative for several 
reasons: additional mass loss might be caused by dynamic effects 
such as the marine ice sheet instability as well as the reduction of 
buttressing due to the retreat or collapse of surrounding ice shelves. 
It is also noteworthy that the projected snowfall increases represent a 
best-case scenario for mitigating Antarctica’s contribution to sea level 
rise under continued oceanic and atmospheric warming. In contrast, if 
snowfall trends remain insignificant, as they have during the past 
decades”*, the sea level contribution of the Antarctic Ice Sheet will 
be even larger. 

The amount of ice flowing across the grounding line in excess of the 
flux balancing the enhanced snowfall is called dynamic ice loss. For the 
entire ensemble, the difference in dynamic ice loss between the full 
scenarios and those with warming only—that is, the snowfall-induced 
ice discharge—increases throughout the simulations, amounting to 
0.07-0.16 m sea-level equivalent for the ECP 2.6 scenario, and 0.63- 
1.25m sea-level equivalent for the ECP 8.5 scenario with ensemble 
means of 0.12 m and 0.92 m, respectively, in the year 2500 (Fig. 3a). 
PISM-PIK is in principle capable of reproducing a dynamic situation 
where the ice discharge is of the magnitude and rate currently observed 
in satellite data°, though much later in this century (Fig. 3b). This may 
be because all scenarios start from an ensemble of equilibrium simula- 
tions that may not be the appropriate initialization for a future pro- 
jection. The results presented here do not rely on this initialization, 
however; they report the relative importance of snowfall compared to 
other currently discussed ice-loss processes. 

The snowfall-induced ice discharge correlates strongly with the 
cumulative amount of snowfall, compensating between 30% and 
65% of the additional ice volume from the projected increase in pre- 
cipitation (Fig. 4a). The ratio of this snowfall-induced ice discharge 
over cumulative snowfall changes predominantly with the warming 


0.5 


Ice loss caused by warming (m) 


For each scenario, the model was forced with the median as well as the upper 
and lower borders of the likely range of warming: that is, the 50th, the 66th and 
the 33rd percentile of the spread generated by climate projections. b, Dynamic 
ice loss through snowfall exceeds ice loss through warming. The ice loss from 
both snowfall and warming amounts to 200% to 300% of the ice loss through 
warming only. 


scenario, ranging, in the year 2500, from 35-50% for the high-emission 
scenario ECP 8.5, to 48-60% for the weaker ECP 2.6, in which 
global mean temperature is kept below two degrees with respect to 
pre-industrial levels'*. The effect is thus highest for low warming scen- 
arios, which might be due to the time-delay of the ice-sheet response to 
atmospheric forcing. The ice loss caused by the additional snowfall is 
larger than the ice loss from the combined warming of ice surface and 
the ocean. It amounts to 100%-200% of the volume change found in 
the warming-only simulations (Fig. 4b). Here the change in ice 
softness due to surface warming results in higher ice loss than the 
warming-induced basal melting, but both effects are significantly 
smaller than the ice loss caused by additional snowfall (Supplemen- 
tary Fig. 2). 

Our study thus shows that an increase in snowfall over Antarctica 
under global warming results in additional dynamic ice loss. For future 
climate scenarios based on the ECPs, this contribution is significantly 
larger than the dynamic ice loss due to warming only. This is consistent 
with the fact that the regional rate of snowfall increases the surface 
gradient across the grounding line nine times stronger than the equi- 
valent rate of basal melt. 


METHODS SUMMARY 


Simulations were carried out with the Potsdam Parallel Ice Sheet Model, PISM- 
PIK, as described in ref. 7, extended by an ocean box model” that simulates the 
main overturning circulation in sub-ice shelf cavities. Lack of knowledge about the 
values of the model input parameters introduces an uncertainty into the projec- 
tions of future ice discharge. To capture this uncertainty, we perturbed the para- 
meters that are most relevant for the ice flow across the grounding line. These are 
the two so-called enhancement factors (which determine the ice viscosity in the ice 
sheet, ice streams and ice shelves) and the pore-water fraction (which influences 
basal friction). For each parameter configuration, an equilibrium state of the 
Antarctic Ice Sheet under present-day boundary conditions”* was generated via 
the spin-up procedure described in ref. 27. We then excluded those initial states 
whose geometry deviates strongly from present-day observations. In particular, 
both the sea-level relevant volume of the West Antarctic Ice Sheet and the 
grounding line position may not differ by more than 10% from their observed 
values. The remaining 81 representations of the Antarctic Ice Sheet, which span 
the widest possible range of dynamic sensitivity compatible with these criteria, 
served as initial states for the simulations discussed here. They were forced by 
surface and ocean temperature trajectories based on the ECPs that were derived 
via a statistical downscaling method* from the corresponding global-mean- 
temperature changes. The dynamic response of the perturbed-physics ensemble 
to the ECP scenarios is further discussed in ref. 24. 
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Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Perturbed-physics ensemble. Simulations were carried out with the Potsdam 
Parallel Ice Sheet Model, PISM-PIK, as described in ref. 7. Lack of knowledge 
about the values of the model input parameters introduces an uncertainty into the 
projections of future ice discharge. To capture this uncertainty, we perturbed the 
parameters that are most relevant for the ice flow across the grounding line. These 
are the two so-called enhancement factors (which determine the ice viscosity in the 
ice sheet, ice streams and ice shelves) and the pore-water fraction (which influ- 
ences basal friction). For each parameter configuration, an equilibrium state of the 
Antarctic Ice Sheet under present-day boundary conditions” was generated via 
the spin-up procedure described in ref. 27. We then excluded those initial states 
whose geometry deviates strongly from present-day observations. In particular, 
both the sea-level relevant volume of the West Antarctic Ice Sheet and the 
grounding-line position may not differ by more than 10% from their observed 
values. The remaining 81 representations of the Antarctic Ice Sheet, which span 
the widest possible range of dynamic sensitivity compatible with these criteria, 
served as initial states for the simulations discussed here. 

Climate scenarios. The climate scenarios for our simulations are based on the 
ECPs'*, which are the extensions of the RCPs**”*. Global mean temperature 
trajectories were obtained from emulations of the full range of CMIP3 AOGCM 
responses'* using MAGICC 6.0'°. The global mean temperature shows a close 
linear relation to the surface temperature over Antarctica as well as the ocean 
temperature near the major Antarctic ice shelves. Using the method described in 
ref. 8, the scaling coefficients were derived and then used to compute the uncer- 
tainty distributions for the surface and ocean temperatures under the ECPs. In our 
simulations of future solid ice discharge, we combined climate and ice-model 
uncertainty by forcing each of the 81 initial states from the perturbed-physics 
ensemble with the 33rd, 50th and 66th percentile of the temperature uncertainty 
distributions. Our results are thus based on a set of 972 simulations. To separate 
the effect of snowfall on solid ice discharge from the effects of surface warming and 
enhanced sub-shelf melting, the whole set of simulations was performed for the 
cases of surface warming only, surface and ocean warming, and surface and ocean 


LETTER 


warming including snowfall increase. The dynamic response of the perturbed- 
physics ensemble to the climate scenarios is further discussed in ref. 24. 
Sub-shelf melting. To simulate the main overturning circulation in the sub-ice 
shelf cavities, which greatly influences the local distribution of basal melting and 
refreezing, an ocean box model’ was used. The ocean box model was initialized 
with recently observed ocean temperatures and salinities!” near the Ross, Filchner- 
Ronne and Amery Ice Shelves as well as the shelf attached to Pine Island Glacier, 
from which we subtracted anomalies of the observational period to 1850 (the first 
year of the simulations). Sub-shelf melting was then computed from the ocean 
temperature and salinity provided by the ocean box model as well as changes in ice 
thickness and thus pressure melting temperature at the underside of the ice 
shelves. This approach enables the model to capture the wide spread of observed 
melt rates underneath Antarctic ice shelves'*. Modelled basal melt rates peak near 
the grounding line, as suggested by observations”. 

Blowing snow sublimation and redistribution. Blowing snow sublimation and 
redistribution might affect the results because it mainly occurs along the coastal 
margins of Antarctica*', where the impact of enhanced snowfall on the ice 
dynamics is greatest. The increase of relative humidity in the lower atmosphere 
due to blowing snow leads to an increase in snowfall, especially in the coastal areas 
where the air is close to saturation’’. This suggests that the impact on the ice 
dynamics presented here would be enhanced by blowing snow. However, this is 
likely to be a higher-order effect because the temporal trends in coastal blowing 
snow accumulation will be negligible compared to the overall increase in snowfall. 
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Deep penetration of molten iron into the mantle 
caused by a morphological instability 


Kazuhiko Otsuka! & Shun-ichiro Karato! 


The core-mantle boundary of Earth is a region where iron-rich 
liquids interact with oxides and silicates in the mantle’. Iron en- 
richment may occur at the bottom of the mantle, leading to low 
seismic-wave velocities and high electrical conductivity”, but 
plausible physical processes of iron enrichment have not been sug- 
gested. Diffusion-controlled iron enrichment is inefficient because 
it is too slow’, although the diffusion can be fast enough along 
grain boundaries for some elements’. More fundamentally, experi- 
mental studies and geophysical observations show that the core is 
under-saturated with oxygen, implying that the mantle next to the 
core should be depleted in FeO. Here we show that (Mg,Fe)O in 
contact with iron-rich liquids leads to a morphological instability, 
causing blobs of iron-rich liquid to penetrate the oxide. This mor- 
phological instability is generated by the chemical potential gra- 
dient between two materials when they are not in bulk chemical 
equilibrium, and should be a common process in Earth’s interior. 
Iron-rich melt could be transported 50 to 100 kilometres away 
from the core-mantle boundary by this mechanism, providing 
an explanation for the iron-rich regions in the mantle. 

Like the lithosphere near Earth’s surface, the core-mantle boundary 
region (CMB) of Earth (and other planets) is a region where major 
physical and chemical actions occur’. Some seismological observations 
suggest the presence of chemical heterogeneity***. Also, anomalously 
high conductance in some regions of the D’’ layer is inferred from the 
observed length-of-day variation””®. The regional variations in elec- 
trical conductivity observed by geomagnetic jerks (sudden changes in 
Earth’s magnetic field) correspond to regions of anomalous seismic 
properties”. Iron enrichment in some regions of the D’’ layer is a 
plausible explanation of these observations. 

However, explaining the iron enrichment is difficult for several reasons. 
In addition to the difficulties with diffusion-controlled models (because 
diffusion is so slow), the capillary mechanism of infiltration applies only to 
a penetration of about 100 m into the mantle or less, owing to the influence 
of gravity’. Iron-rich core materials may infiltrate into the mantle along 
the pressure gradient caused by the dynamic topography at the CMB”. 
However, the degree of melt penetration by this mechanism is controlled 
by the amplitude of dynamic topography, which is in turn controlled by 
the viscosity of the D’’ layer. If we use the viscosity of about 10’* Pas for 
the D’’ layer’, the extent of iron penetration for this mechanism is less 
than a metre, too small to cause any appreciable effects. 

Here we report the experimental observations of penetration of 
iron-rich metallic liquid blobs into the single crystals of (Mg,Fe)O 
through the morphological instability. The penetration depth of 
iron-rich blobs observed in these experiments far exceeds the penetra- 
tion depth attributable to a simple diffusion-controlled model or the 
other mechanisms discussed above (the capillary mechanism and 
dynamic topography). Hence, this morphological-instability process 
is more likely to explain the iron enrichment inferred in some regions 
of the CMB. We describe our experimental observations, interpret the 
observations in terms of a physical model, and discuss the possible 
implications of the deep penetration of molten iron-rich blobs for the 
properties and dynamics of the D’’ layer. 


We conducted high-pressure and high-temperature experiments 
using a multi-anvil apparatus in which iron-rich liquid and single 
crystals of (Mg,Fe)O are in direct contact (see Supplementary Inform- 
ation for details). After annealing, we found that each interface 
between an (Mg,Fe)O crystal and the molten iron was serrated and 
the liquid metal had penetrated into the (Mg,Fe)O single crystal to 
form a layer containing many metal-rich blobs (we call this the metal- 
rich layer, or MRL) (Fig. 1a). The serrated morphology of the metallic 
liquid and the (Mg,Fe)O interface is characteristic of the morpho- 
logical instability. Notably, the morphological instability was not 
observed at the wall of pure MgO single crystals, indicating that the 
FeO in (Mg,Fe)O plays an essential part in this. 

The chemical composition of matrix (Mg,Fe)O changes gradually 
over the region of liquid penetration (MRL) such that FeO content 
increases with the distance from the interface (see Fig. 1b). This means 
that the chemical equilibrium is established only near the interface, 
and the bulk of the crystals is out of equilibrium. 

The characteristic wavelength of the interface morphology mea- 
sured along the interface is approximately 1-10 um (Fig. 2). The 
least-squares fit with 2=k,L™ (where / is the instability wave- 
length, L is the thickness of the MRL and k,, and m are constants) 
gives m=0.5+0.2 for Fe-(Mgo.45Fep.s5)O and 0.3+0.3 for Fe- 
(Mgo,75Feo 25)O. 

The thickness of the MRL increases with annealing time, whereas 
their migration velocity decreases with time (Supplementary Infor- 
mation). The least-squares fit with L=k,t” (where t is duration of 
experiments and k,, and n are constants) gives n = 0.5 + 0.2. Such a 
relationship is consistent with a model of MRL growth (see below). 
Consequently, we may define an effective diffusion coefficient corres- 
ponding to the extent of melt migration. The effective diffusivity Dur 
was calculated using the equation Durr, = L’/t. Importantly, transcry- 
stalline melt migration is a much more efficient mechanism of chemi- 
cal transport than the conventional Fe-Mg interdiffusion’* under most 
of the conditions investigated. 

A morphological instability similar to the one observed in this study 
has been documented in many systems where two materials sharing 
one component (FeO in our case) are in contact but chemical equili- 
brium is attained only near the interface’*. The concentration gradient 
of the common component provides the driving force for this insta- 
bility. The characteristic wavelength at which the instability grows 


fastest is given by'®: 
A= 2n/30C3,/G (1) 


where J’ =yQ/RT is the capillarity length, Cc is the equilibrium FeO 
concentration in solid (Mg,Fe)O and G is the gradient of the FeO 
concentration (where y is the liquid-solid interfacial energy, Q is liquid 
molar volume and R is the gas constant). Also, the timescale on which 
the instability grows is given by t~A*/4nD, where D is the Fe-Mg 
interdiffusion coefficient’’. 

The elongated fingers formed at the interface are eventually pinched 
off (as a result of the surface tension) to produce isolated melt inclusions 
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Figure 1 | Distribution of Mg and Fe in the annealed couples of molten Fe 
and solid (Mg,Fe)O. a, Mapping of Mg concentration (left panels) and Fe 
concentration (right panels) in molten Fe and solid (Mgo 45Feo.55)O annealed at 
2,123 K for 60s (bottom panels, run number K1234), and in molten Fe and 
solid (Mgo 4sFeo 55)O annealed at 2,123 K for 300s (top panels, run number 
K1237). In left panels, red corresponds to high Mg concentration and green 
corresponds to low Mg concentration. In right panels, red corresponds to high 
Fe concentration and blue corresponds to low Fe concentration. b, Typical iron 
contents—FeO/(FeO + MgO) in molar ratios—in (Mg,Fe)O over the region of 
transcrystalline migration of iron-rich liquid inclusions as a function of 
distance from the melt and solid interface. 


(blobs) in the (Mg,Fe)O matrix. The isolated blobs will consume FeO 
from (Mg,Fe)O at the leading edge, while excess FeO will be precipitated 
at the another edge. In this way, blobs migrate while they re-establish 
the gradient of FeO that was initially defined by the degree of disequi- 
librium and the diffusion coefficients of the relevant species. The ana- 
lysis summarized in the Supplementary Information shows that the 
gradient in FeO concentration is approximately given by: 


de AC 

dx LL 

where Cis the concentration of FeO, x is the distance measured from the 
interface, CS, is the FeO concentration in (Mg,Fe)O far from the inter- 
face, and se is the FeO concentration in (Mg,Fe)O that will be in equi- 
librium with the molten iron. Consequently, the growth of the MRL 
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Figure 2 | Instability wavelength 4 plotted against thickness of chemical 
boundary layer L. Squares, Fe-(Mgp_.4;Feo 55)O in MgO capsule; diamonds, Fe- 
(Mgo.7sFeo.25)O in MgO capsule; and circle, MoO3-(Mgo.75Feo.25)O in Pt/Mo 
double capsule. Symbol colour indicates experimental temperature, for 
example from 1,973 K (dark blue) to 2,473 K (red). Arrows pointing right 
indicate that only the minimum values of the thickness of the chemical 
boundary layer are constrained. The stability limit of the interface is given here 
by 4 = L. In the unstable regime, the relation between instability wavelength 
and the thickness of the chemical boundary layer was estimated using 
A=2n\/3LT /¢( = Ams), where Ams is the length of the Mullins-Sekerka 
instability at temperature 2,273 K and pressure 12.5 GPa with the range of y/¢ 
values from 0.01 to 100. Error bars (both horizontal and vertical) indicate one 
standard deviation. 


follows dL/dt=AAC/L, where A is a constant that depends on the 
mechanism of melt blob migration. Therefore the growth of the MRL is 
described by: 


L=VAACt= V/Dypyt (2) 


where Dug, (= AAC) is the effective diffusion coefficient. 

Our experimental observations also show that Dygr is strongly 
dependent on temperature, and therefore we use the relation 
Dyirt = AoAC exp (— E*/RT) to analyse the data (Fig. 3). We found 
that the activation energy E* is significantly smaller than that for the 
interdiffusion coefficient'*’”'* in (Mg,Fe)O, suggesting that processes 
other than diffusion control the rate of liquid blob migration. One 
possibility is control by dissolution-precipitation. 

The outer core is predominantly composed of iron-rich molten 
metal with minor amounts of light elements, including oxygen and 
silicon’’. The inferred amount of minor light elements such as oxygen 
in the outer core is significantly smaller than their solubility in the 
outer core at the present-day CMB in equilibrium with the normal 
mantle composition’. The degree of supersaturation for (Mg,Fe)O at 
the base of the mantle is estimated to be ¢ ~ 10. Using a plausible value 
(of the order of 1Jm *) of the interfacial energy, the morphological 
instability probably occurs at the CMB with a characteristic wave- 
length of a few centimetres. 

To apply our results to the CMB, we need to evaluate the influence 
of temperature and pressure on the kinetics of MRL growth, because 
our experiments were conducted at a pressure significantly lower than 
the CMB pressure”! (around 135 GPa) and at lower temperatures than 
the temperature at the CMB” (~3,000-4,000 K). We use homologous 
temperature scaling: Duri(T,P)=Duri(T/Tm(P)), where Tmn(P) is 
the pressure-dependent melting temperature. For a reasonable value 

C.-C 

Ce, 
ness of the MRL is about 50 km after a billion years, and 100 km after 
4 billion years. We conclude that melt penetration by the morphological 


of supersaturation, ¢ | = = 10, we estimate that the thick- 
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instability in (Mg,Fe)O is capable of transporting iron-rich core com- 
ponents over tens of kilometres at the base of the lower mantle, affecting 
the physical and chemical properties of this region. 

The upward migration of iron-rich liquid has implications for seve- 
ral aspects of core-mantle interactions. The penetration of iron-rich 
blobs to tens of kilometres from the CMB can easily explain large 
velocity reductions and the high electrical conductance of some re- 
gions in the D’’ layer. However, the amplitude of velocity anomalies 
(~2-3%) and the depth extent of the large low-velocity province’ 
(~200-300km) may not be consistent with the iron penetration 
model: the amplitude of velocity reduction is too small and the thick- 
ness is too large. Iron penetration may affect physical properties in 
more localized regions such as a thin layer at the bottom of the large 
low-velocity province where the concentration of (Mg,Fe)O is large 
and in small regions with unusually low velocities (see Fig. 4). 

Finally, the liquid iron penetrating into the mantle may account for 
the isotope signatures of the core materials observed in some of the 
ocean islands**. However, processes that could produce such a sig- 
nature in volcanic rocks erupted at the surface are not well understood. 
The average density of an iron-rich region is too large for large-scale 
entrainment to occur by mantle convection*. Some small-scale pro- 
cesses, such as flow-induced material segregation”, would be needed 
to carry core-affected materials to the surface by a plume. Alterna- 
tively, some of the heterogeneities in the D’’ region could be attributed 
to the remnant of the magma ocean” and the pile-up of subducted 
materials”. 

Some issues remain uncertain and hence require further study. First, 
in this work we studied coupled iron-rich melt and (Mg,Fe)O. At 
Earth’s CMB, not only (Mg,Fe)O but silicate perovskite or silicate 
post-perovskite also occur. The penetration depth and the volume 
fraction of metallic blobs in perovskite (and post-perovskite) will be 
different in these minerals because of the difference in diffusion coef- 
ficient of Fe and the properties that control the blob migration (that is, 
dissolution-precipitation kinetics). Because iron diffusion in perovs- 
kite is much slower than in (Mg,Fe)O (ref. 17), either the density of 
blobs is lower or the MRL is thinner in perovskite than in (Mg,Fe)O. 
Consequently, the extent to which iron enrichment occurs at the CMB 
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Figure 3 | Effective diffusivity Dr, corresponding to the migration of 
iron-rich liquid blobs compared with the Fe-Mg interdiffusion coefficient 
in (Mg,Fe)O. Red squares, Fe-(Mgpo 45Feo.s5)O in MgO capsule; blue 
diamonds, Fe-(Mgp.75Feo.25)O in MgO capsule; and green circle, MoO>- 
(Mgp.75Feo 25)O in Pt/Mo double capsule. Solid lines indicate the Arrhenius 
equation Dur = (CS, — Ce,)A0 exp|—(EmgXmg + EreXre)/RT] where E is 
activation energy, and X is the mole fraction, fitted to the data (colours as for 
data points). Dashed black lines indicate the Fe-Mg interdiffusion coefficients'* 
for (Mgo.45Feo.55)O (top line) and (Mgy 75Feg.25)O (bottom line). Vertical error 
bars indicate one standard deviation. Horizontal error bars indicate the 
uncertainties in temperature caused by the temperature gradient or by the 
thermocouple failure (in the latter case, temperature was estimated using the 
power-temperature calibration). 
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is probably controlled by the volume fraction of (Mg,Fe)O. When the 
volume fraction of (Mg,Fe)O exceeds the percolation threshold (about 
20%), then substantial penetration of molten iron will occur. 
Consequently, regional variation in the volume fraction of (Mg,Fe)O 
will control the extent to which iron enrichment occurs at the CMB. 

Further studies are needed to investigate the kinetics of metallic 
liquid penetration in perovskite (and post-perovskite). The CMB in 
other planets, such as Mercury, is made of olivine and other low- 
pressure minerals. A dense layer is inferred to exist in the deep mantle 
of Mercury” that might be caused by the penetration of iron-rich 
materials into the mantle. Similar studies of olivine and other low- 
pressure minerals are also important. 

Second, the influence of grain boundaries on liquid blob penetration 
is unknown. Although iron-rich melt does not completely wet the grain 
boundaries of silicate minerals in most cases*’, complete wetting may 
occur in (Mg,Fe)O. Gravity will affect the nature of metal penetration if 
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Figure 4 | Influence of liquid iron blobs on geophysically observable 
properties. a, Estimated shear-wave velocity of the silicate/oxide mantle 
containing the liquid iron. The solid lines are calculated using the oblate 
spheroid model with aspect ratio ~. The shaded region is the range of observed 
velocities in unusually low-velocity zones. The plausible volume fraction of 
molten iron is ~0.2-0.3 (see Fig. 1a), and with these values, a reduction of 
seismic wave velocity of ~ 10-20% is easily explained with an aspect ratio of 
0.2-0.5 (see Fig. 1a). b, Electrical conductivity of the silicate/oxide mantle 
containing the liquid iron. ¢, is the volume fraction of iron-rich melt 
corresponding to the percolation threshold (representative value is about 0.2). 
Omin ANd Cmax are the minimum and maximum conductivities corresponding 
to an assumed layer thickness L that explain the geophysical observations. 
Inferred ranges of electrical conductivity of a conductive layer at the base of the 
mantle depend on the thickness of the metal-rich layer, L. The light- and dark- 
shaded regions represent conductive layers of thickness 25 km and 2.5km 
thick, respectively. Again, geophysically inferred conductivity can be explained 
by the presence of molten iron with a volume fraction of 0.2-0.3. 
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the iron-rich melt completely wets the grain boundaries. Further stud- 
ies of the penetration of metallic liquid blobs into polycrystalline and 
multi-phase materials will help us to understand core-mantle chemical 
interactions. Finally, the degree of chemical disequilibrium at the CMB 
probably reflects the chemical history of the CMB and could be he- 
terogeneous, which would cause the observed lateral heterogeneity in 
the penetration of molten iron into the mantle. 


METHODS SUMMARY 


All experiments were performed using the Kawai-type multi-anvil press. The 
(Mg,Fe)O single crystals with a range of FeO content were synthesized at 1,873 K 
for around 300 h at 0.1 MPa using a high-temperature controlled-oxygen-fugacity 
furnace. Single crystals of (Mg,Fe)O were annealed together with molten iron at 
high pressures and temperatures for a range of time (2 to 3,600s). After each 
experiment, the sample was quenched by shutting off the furnace at a given pressure 
and the distribution of elements (Fe and Mg) in recovered samples was determined 
using a field-emission-gun electron probe microanalyser (JXA-8530F). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Starting materials consisted of single crystals of (Mg,Fe)O and reagent-grade Fe 
metal or MoO, powder. (Mg,Fe)O crystals were synthesized by annealing MgO 
crystals embedded in the mixtures of MgO and Fe,O; haematite powder in a gas- 
mixing furnace at temperature of 1,873 K and at an oxygen fugacity of 1 Pa for 
approximately 300 h as described in ref. 31. The chemical analysis of two sets of 
recovered samples by a field-emission-gun electron probe micro-analyser (JXA- 
8530F) showed that the molar Mg/(Mg + Fe) ratio ranged between 0.45 and 0.50 
and between 0.75 and 0.80 for each crystal and varied less than 1% over the sample 
size used for high-pressure, high-temperature experiments. We refer to those 
compositions as (Mgp 4sFeg 55)O and (Mgp.75Fep 25)O, respectively. Those crystals 
were drilled into cylindrical shapes with thicknesses from 0.5 to 0.8mm and 
diameters of 1.1 mm with the cylindrical axis oriented close to the [100] crystal- 
lographic direction. 

High-pressure and high-temperature experiments were performed using a 
1,000-ton Kawai-type multi-anvil apparatus. Tungsten carbide cubes with the 
truncation edge length of 8 mm or 11 mm were used as second-stage anvils. The 
octahedral edge-length of the Cr,03-doped MgO pressure medium was either 
14mm or 18mm for the 14/8 and 18/11 assemblies, respectively. The pressure- 
load relationship for these cell assemblies were calibrated as described in ref. 32. 
The cross-section of the cell assembly is shown in Supplementary Fig. 1. For the 
14/8 assembly, one single crystal of synthetic ferropericlase was packed with Fe 
powder in a sample capsule made of single crystals of MgO with inner diameter 
1.7 mm and outer diameter 2.6 mm. The sample capsule was directly inserted to a 
stepped LaCrO; furnace. For an 18/11 assembly, one (Mg,Fe)O crystal was packed 
with MoO, powder in an outer Pt and inner Mo double capsule with inner 
diameter 1.6 mm and outer diameter 2.0 mm. The double capsule was insulated 
from a stepped graphite furnace by a MgO cylinder. For both assemblies, the 
cylindrical axis of the sample capsule was vertically aligned with gravity to avoid 
gravitational rearrangement during annealing. Temperature was monitored with a 
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WosRes-W74Re2¢ thermocouple with a thermocouple junction placed in contact 
with one end of the sample capsule without correcting for the effect of pressure on 
electromotive force. The ceramic parts of the cell assemblies were fired at approxi- 
mately 1,000 K overnight before assemblage and kept in a vacuum oven at approxi- 
mately 400 K. 

In each experiment, the starting material was brought up to pressure by raising 
the load at room temperature. Subsequently, temperature was raised by applying 
current across the furnace at two different heating rates: first ata rate of 50 K min’! 
up to 1,873 K, close to the eutectic temperature in the Fe-FeO system***’, and later 
at a much higher rate, usually 100-200 K min ' (or even higher), up to the target 
temperature to minimize possible chemical reactions between (Mg,Fe)O and 
metallic iron during heating. For a 14/8 cell assembly, the resistance of the 
LaCrO; furnace occasionally dropped during heating at around 1,823-1,873 K 
owing to the leakage of molten Fe from the MgO capsule, which caused very rapid 
temperature ramp-up at rates of 400-1,000 K min“! in the voltage-controlled 
heating system. Experimental durations were typically of the order of seconds to 
hours depending on the speed of melt migration. The samples were then quenched 
isobarically by shutting off the heating power and subsequently decompressed at 
room temperature. Experimental conditions are typically pressure 12.5 GPa and 
temperature 1,973-2,373 K. 
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Nonlinear dendritic integration of sensory and motor 
input during an active sensing task 


Ning-long Xu', Mark T. Harnett!, Stephen R. Williams’, Daniel Huber'+, Daniel H. O’Connor}}, Karel Svoboda! & J effrey C. Magee! 


Active dendrites provide neurons with powerful processing capabi- 
lities. However, little is known about the role of neuronal dendrites 
in behaviourally related circuit computations. Here we report that a 
novel global dendritic nonlinearity is involved in the integration of 
sensory and motor information within layer 5 pyramidal neurons 
during an active sensing behaviour. Layer 5 pyramidal neurons possess 
elaborate dendritic arborizations that receive functionally distinct 
inputs, each targeted to spatially separate regions’. At the cellular 
level, coincident input from these segregated pathways initiates 
regenerative dendritic electrical events that produce bursts of action 
potential output** and circuits featuring this powerful dendritic 
nonlinearity can implement computations based on input correla- 
tion’. To examine this in vivo we recorded dendritic activity in layer 
5 pyramidal neurons in the barrel cortex using two-photon calcium 
imaging in mice performing an object-localization task. Large- 
amplitude, global calcium signals were observed throughout the apical 
tuft dendrites when active touch occurred at particular object loca- 
tions or whisker angles. Such global calcium signals are produced by 
dendritic plateau potentials that require both vibrissal sensory input 
and primary motor cortex activity. These data provide direct evidence 
of nonlinear dendritic processing of correlated sensory and motor 
information in the mammalian neocortex during active sensation. 

The unique integrative properties of cortical pyramidal neurons 
enable them to powerfully transform synaptic-input patterns delivered 
through structured network connectivity®. Of particular interest is the 
ability of pyramidal-neuron dendrites to actively integrate input from 
spatially segregated and functionally distinct pathways (for example, 
sensory or motor pathways) when strong temporal correlations exist 
between these representations*°. Circuit computations based on active 
dendritic transformation of different streams of information by pyra- 
midal neurons could underlie a variety of functions that include top- 
down cortical interactions, associative feature binding and predictive 
coding’"°. To investigate the role of dendritic integration in beha- 
viourally relevant network computations, we used two-photon micro- 
scopy to image dendritic Ca** signals from distal tuft branches of layer 
5 pyramidal neurons of the barrel cortex (S1) labelled with a genetically 
encoded calcium indicator (GCaMP3)"" through a chronic imaging 
window while mice performed a task based on vibrissal active touch 
(Fig. la and Methods). Head-fixed mice were trained to move either 
one row of whiskers or a single whisker to locate an object (Fig. 1a, d, 
Supplementary Fig. 1 and Methods)". In ‘go’ trials, a vertical pole was 
presented at one of four positions along the anterior—posterior axis and 
a correct response was defined as licking for water reward. In ‘no-go’ 
trials, the pole was located in a more anterior position and the correct 
response was to withhold licking. Trained mice touched the pole in the 
go positions but rarely in the no-go position. Experimental beha- 
vioural sessions began after an training period (signal detection stat- 
istic (d’) > 1.8; Supplementary Fig. 1 and Methods). Whiskers were 
tracked simultaneously using a high-speed imaging system to quantify 
whisker-related behavioural variables’? (Fig. 1a). 


During task performance large-amplitude Ca** transients were 
observed in apical tuft dendrites of layer 5 pyramidal neurons (~20- 
200 um below the pial surface; amplitude, 89.9 + 42.9% peak AF/F; 
duration, full width at half maximum = 0.48 + 0.31s; Fig. 1b, ¢, 
Supplementary Fig. 2 and Methods). The dendritic Ca*” signals were 
primarily observed during the time period when the target pole was 
present (sampling period; Fig. 1c, d), and less frequently following this 
period (post-sampling period; Supplementary Fig. 3 and Methods). 
Quantitative analysis of the whisker—object interactions showed that 
dendritic Ca** transients preferentially occurred in trials that con- 
tained whisker—object contacts (Fig. 1c—e), with short latency after first 
touch (40 + 38 ms, n = 57 regions of interest (ROIs) from 4 animals; 
Fig. 1d). Periods of isolated whisking (non-touch trials) were not asso- 
ciated with detectable tuft Ca** transients, even though whisking alone 
is capable of evoking action potentials in layer 5 pyramidal neurons'***. 
The touch dependence and short latency from object contact suggest a 
role for ascending sensory input in the generation of dendritic Ca" 
signals. However, passive whisker deflection in anaesthetized mice was 
ineffective in producing dendritic activation (Supplementary Fig. 4)’”. 
The above data suggest that active sensation, involving both whisking 
and object touch, is required for apical tuft dendrite activation. 

To examine the spatial extent of behaviourally evoked dendritic 
Ca’* signals, we imaged multiple apical dendritic branches of a single 
neuron simultaneously during behavioural sessions (sparse labelling 
condition, Fig. 2aand Methods). Large Ca** signals occurred synchro- 
nously in all of the imaged branches belonging to the same cell (Fig. 2b, c), 
such that strong correlations were observed among the branch Ca”* 
signals (Fig. 2d, f; R=0.69+0.13 (+s.d.), 1 =18 neurons; Sup- 
plementary Fig. 5). However, this was not the case for branches that 
belonged to different cells (Fig. 2d-f, R = 0.097 + 0.13, n = 30 imaging 
fields; Supplementary Fig. 5 and Methods). These results indicate that, 
in contrast to previous experiments using passive sensory stimulation 
in anaesthetized mice'’*° or freely whisking mice”, a salient beha- 
vioural event (that is, active touch) produces a strong activation of 
the entire apical tuft dendrite region. The widespread nature of the 
Ca’* signals also differs from that expected from more localized 
NMDA (N-methyl-p-aspartate) spikes”. 

To determine the electrophysiological basis of the observed tuft 
Ca’* signals, we performed in vivo targeted whole-cell recordings 
from apical tuft dendritic branches of layer 5 pyramidal neurons posi- 
tive for GCaMP3 in the barrel cortex of anaesthetized mice (Fig. 3a, b). 
We observed electrical events of moderate amplitude (~30mV) 
and relatively long durations (~20 ms) during periods of heightened 
network activity (‘unpaired events’, 0.26 + 0.13 Hz; Fig. 3c, d and 
Supplementary Fig. 6). These unpaired events were not associated with 
any detectable change in GCaMP3 fluorescence in dendritic regions 
near the recording pipette (Fig. 3c, d, f). However, prolonged depo- 
larization of the apical tuft dendrite (>15mV, 500 ms) by current 
injection converted these electrical events into large-amplitude 
and long-duration regenerative plateau potentials (‘paired events’, 


Howard Hughes Medical Institute, Janelia Farm Research Campus, Ashburn, Virginia 20147, USA. Queensland Brain Institute, The University of Queensland, St Lucia, Queensland 4072, Australia. 
+Present addresses: Department of Basic Neurosciences, University of Geneva, CH-1211 Geneva, Switzerland (D.H.); Department of Neuroscience, Johns Hopkins University School of Medicine, Baltimore, 


Maryland 21205, USA (D.H.O). 


13 DECEMBER 2012 | VOL 492 | NATURE | 247 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 
rAAV-GCaMP3 


Two-photon 
microscope oe 


Imaging 
area 


b By um 


— 
'S 


Go position 


No-go position 


Head 


fixation 
Water a 


reward Whisker MANS 00: — 
- or VV Ve mm L200% 
a “High-speed videography Tee 50 ms I 0.58 


1 2 
t= 2.044 | t= 2.060 ‘ t= 2.068 | 


Pre touch Whisking + touch Post touch 
c d 
& 
[s} 
> 
2 
fe} 
Zz 
l 100% 
1s 
e 70 == Touch C2 
4 50 ™ Touch non-C2 
<i 30 == Non-touch 
* 410 
0 15 3 45 
Ss f Time from first touch onset (s) 
S 
e wo 150 
= P<3x 1076 
s n=57 
[o} 
3 c 100 
ce 
as 
a> 50 
a 
° 
ai + i ans Eanes 
o —— 
oO (0) 1 1 
0 50 100 150 
Ca?+-signal touch trials 
(% AF/F) 


Figure 1 | Active touch evoked large dendritic Ca”* signals in distal tuft 
branches of layer 5 pyramids. a, The experimental set-up; a head-fixed mouse 
performs a whisker-dependent object-localization task under a microscope. 
The scanning laser beam (red) is focused on distal dendrites of GCaMP3- 
labelled neurons (green) through an imaging window. The mouse actively 
whisks to find the pole and makes a lick response (go) or withholds licking (no- 
go). Whisker motion was recorded (bottom) and quantified (whisker angle, 0, 
and curvature change, k; grey shows touch). Top left, schematic showing two- 
photon imaging set-up. Top right, GCaMP3 is expressed in deep layers of barrel 
cortex. N.A., numerical aperture. b, Tuft branches (top and middle panels) and 
Ca’* signals (AF/F; bottom panel) from different sub-regions of a single branch 
(green dashed boxes). Middle panel is a magnified version of a region of the top 
panel (outlined by the green box). c, Colour raster of Ca”~ signals from all trials 
of a behavioural session sorted into touch (bottom panel) and non-touch trials 
(top panel). d, Touch trials from ¢ expanded as traces. Vertical bars in ¢ and 
d show sampling period. e, Averaged dendritic Ca’* signals aligned to the onset 
of touch from trials with C2 whisker touch (Touch C2), with touch from 
whiskers other than C2 (Touch non-C2) and with no touch (Non-touch). 

f, Population data of dendritic Ca”* signals (AF/F amplitude) during whisker 
sampling time, compared between touch trials and non-touch trials (n = 57 
dendritic branches, 4 animals). Data in e and f are mean + s.e.m. 


2.23 + 0.55 Hz; Supplementary Fig. 6) that were associated with large 
dendritic Ca”* signals (Fig. 3 and Supplementary Fig. 6). 

Dual dendritic whole-cell recordings from layer 5 pyramidal neurons 
in acute neocortical slices revealed that apical trunk Ca’* spikes 
evoked by local current injection decreased in amplitude as they spread 
back into the tuft dendrites, producing events that were very similar in 
amplitude, duration and shape to the unpaired events observed in vivo 
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Figure 2 | Global dendritic tuft Ca”* signals during active sensing. 

a, Simultaneous Ca”* imaging during behaviour from multiple tuft branches 
(1-5) of the same cell, identified using three-dimensional reconstruction of 
imaging stacks. b, Ten consecutive trials from the five ROIs indicated in 

a. Signals 0.5 s before to 0.5 s after the sampling period are shown. Vertical bars 
indicate trial types. c, Correlation matrix for the ROIs in a. d, Ca’ -signal 
correlation between branches for sparsely labelled imaging fields. ROIs from 
the same (magenta) and different (green) cells are labelled. e, Correlation 
similar to that in d, for a densely labelled imaging field. f, Histogram of 
population data showing distribution of pair-wise correlation for ROIs from 
multiple cells (R = 0.097 + 0.13) and for ROIs from single cells 

(R= 0.69 + 0.13). 


(Fig. 3g and Supplementary Figs 6 and 7). Such unpaired events were 
not associated with substantial GCaMP-mediated tuft Ca~* signals 
(Supplementary Fig. 8). Pairing of the back-spread apical trunk Ca7* 
spikes with simultaneous tuft dendrite depolarization also increased 
their amplitude and duration to a similar extent to the spikes observed 
in the in vivo recordings (paired events; Fig. 3g and Supplementary Figs 
6 and 7). Similar results were obtained when synaptic input was used to 
produce tuft depolarization (Supplementary Fig. 9). 

In addition to increasing the amplitude and duration of individual 
tuft plateau potentials, the local depolarization also produced multiple 
or repetitive events during the current-injection period” (mean plateau 
interval: 209 + 20 ms). Multiple plateau potentials further enhanced 
the amplitude and duration of the associated tuft Ca7* signals (Sup- 
plementary Fig. 6). The time course of the Ca** signals in the anaes- 
thetized mice covered a similar range to those observed during active 
sensation (Fig. 3f and Supplementary Fig. 6). Together, these data 
indicate that tuft dendritic Ca** signals recorded during active sen- 
sation were produced by single or repetitive tuft plateau potentials 
evoked through an interaction between apical dendrite Ca** spikes 
and coincident tuft dendrite depolarization. Importantly, this inter- 
action enhanced the frequency and duration of the apical trunk Ca** 
spikes (paired trunk events; Fig. 3g and Supplementary Figs 6, 8 and 9), 
providing a mechanism by which tuft depolarization can effectively 
influence action potential output. 

What might be the source of apical tuft depolarization during beha- 
vioural conditions? It is known that the vibrissal motor cortex (vM1) 
provides a major excitatory input to layer 1 of barrel cortex'*”*, and 
that vM1 neurons code for whisking parameters during active sensing 
behaviour, with this activity being fed back through vM1 axons to 
layer 1 of S1 (refs 25-27). We reasoned that the excitatory input from 
vM1 to the distal dendritic regions of $1 layer 5 pyramidal neurons 
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Figure 3 | Dendritic Ca”* signals are produced by tuft plateau potentials. 
a, Schematic of targeted dendritic patch-clamp recording from an 
anaesthetized mouse in vivo. Vm, membrane voltage. b, Side projection from a 
GCaMP3-expressing layer 5 pyramidal neuron. The pipette was filled with 
Alexa 594. c, Ca** signals (red) and simultaneous intracellular voltage 
recordings (black). The top records contain only spontaneous dendritic voltage 
events without dendritic current injection. The three bottom records include 
500-ms current injections. I inject, current injection. d, Detailed views of the 
voltage-recording traces in c (indicated by the dashed boxes). Top two traces 
show increased network activity that appears as excitatory postsynaptic 
potentials (EPSPs) and moderate unpaired events (asterisk). Bottom traces 


may interact with ascending sensory input during active touch to 
produce the observed global distal dendritic nonlinearity. To test this, 
we silenced vM1 using localized muscimol injection and carried out 
calcium imaging from the same dendritic branches across several days 
of behavioural sessions. Dendritic Ca** signals evoked by active touch 
during control sessions were largely abolished during sessions when 
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show that pairing of these spontaneous events (grey traces, unpaired) with tuft 
depolarization converts them into regenerative plateau potentials (black traces, 
unpaired). e, Tuft plateau potentials are highly correlated in time with Ca?* 
signals recorded nearby. f, Event-duration distributions for plateau-potential- 
evoked Ca** signals from anaesthetized mice (red) and for all Ca?* events from 
awake behaving mice (blue). Occurrence normalized to maximum. g, In vitro 
simultaneous dual whole-cell voltage recordings from layer 5 pyramidal neuron 
tuft and apical trunk dendrites show that unpaired trunk spikes weakly spread 
into the tuft (grey trace), whereas those paired with local tuft depolarization 
become regenerative plateau potentials with enhanced amplitudes and 
durations (black trace). 


vM1 was silenced, and recovered in subsequent recording sessions, 
approximately 24h after injection (Fig. 4a, b, f and Methods). To 
control for the specificity of vM1 silencing, we injected the same dose 
of muscimol to a control site that was the same distance from the 
imaging window as vM1 (~3.2 mm). Imaging from the same den- 
dritic branches showed that whisker touch-evoked dendritic Ca** 
signals were still present after muscimol injection to the control site 
(Fig. 4a, b, g, h). 

To determine the effect of VM1 silencing on whisker motion and the 
strength of sensory drive, we examined both whisking amplitude 
and the forces on the whiskers (total peak-curvature change (|Ak]); 
see Methods) for each experimental condition. We found that vM1 
silencing did not reduce the total |Ax| compared to the control con- 
dition (Fig. 4c), but it greatly diminished dendritic Ca’ signals for the 
entire range of sensory-input strength (Fig. 4e). Whisking amplitude 


Figure 4 | vM1 silencing abolished touch-evoked dendritic responses. 

a, Colour raster of Ca”” signals of touch trials from a dendritic branch imaged 
across behavioural sessions for four conditions. White vertical lines indicate the 
sampling time. b, Averaged Ca’” signals aligned to the onset of whisker touch 
from all touch trials under each of the four conditions. c, Cumulative 
distribution of total peak-curvature change (total A]1|) for all touch trials 
under each of the four conditions. d, Cumulative distribution of mean whisking 
amplitude during sampling time for all touch trials under each of the four 
conditions. e, Stimulus-response relationship under each of the four 
conditions. Mean AF/F during sampling time plotted as a function of total A| | 
from protraction touch. Open circles, data from each trial. Solid lines connect 
binned data points. f, Summary of population data comparing Ca’~ signals 
from the same dendritic branches during control sessions and sessions with 
muscimol injected into vM1 (n = 23 branches, 4 animals). g, Summary of 
population data comparing Ca?” signals from the same dendritic branches 
during control sessions and sessions with muscimol injected into a control site 
(n = 20 branches, 3 animals). f, Summary of population data of dendritic Cart 
signals under the four conditions. ***P <7 x 10”). Error bars, s.e.m. 
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during the sampling period was slightly reduced by muscimol injec- 
tion to either vM1 or the control site (Fig. 4d). However, as dendritic 
responses were only affected by injection to vM1, this non-specific 
reduction in whisking amplitude cannot account for the reduced 
dendritic responses. We also observed that muscimol injection to 
vM1, but not to the control site, largely eliminated dendritic responses 
to a second type of whisker stimulation in awake animals (50-ms air 
puff; Supplementary Figs 4 and 10). These results indicate that distal 
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Figure 5 | Whisker-angle dependence and object-location selectivity. 

a, Colour raster of dendritic Ca** signals from an example ROI. Trials are 
sorted according to object locations (indicated by red vertical bars, as shown in 
b). b, Mean AF/F averaged from trials with different object locations. Colours 
correspond to object locations (measured as the distance from the centre of the 
whisker pad along the anterior-posterior axis). c, Averaged Ca”~ signals (filled 
circles) and total touch induced |Ax] (open circles) plotted against object 
location (object locations as above). d, Stimulus-response relationship for 
dendritic Ca** signals at different object locations. Top row, projection of five 
frames of a tracked whisker showing whisker-object contact at different object 
locations (object locations as above). Whisker images were truncated beyond 
the object. Whisker set points at touch are indicated. Bottom row, AF/F during 
sampling time as a function of touch-induced total |Ax| for different object 
locations (whisker angles). Lines are linear regression fits. e, The slope of 
stimulus-response relation (from linear regression as in d) from trials binned 
by whisker set point during touch (bin size = 5°) for population data (n = 22 
branches from 5 animals with single whisker-object contacts). Whisker set- 
point bins for all dendritic branches were aligned with the highest slope value 
set to 0°. The slope value at 0° is significantly higher than that obtained in the 
same manner from randomly permuted data (P < 0.0001; see Methods). Open 
grey circles indicate data from single dendritic ROIs, filled black circles 
represent averaging across all ROIs for each set-point bin (mean + s.e.m). 
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tuft depolarization is required for the behaviourally evoked dendritic 
activity and that this depolarization is provided by vM1 input to L1. 
Thus, it seems that distal dendritic plateau potentials are responsible 
for producing a nonlinear dendritic integration of coordinated sen- 
sory and motor information in layer 5 pyramidal neurons during 
active sensing. 

Finally, we observed a pronounced dependence of the dendritic 
Ca’~ signal amplitude on the particular location of the whisker object 
touch (Fig. 5a, b). This location dependence does not seem to be due to 
differences in the mean forces generated by whisker touch at the dif- 
ferent object locations (Fig. 5c). Furthermore, the relationship between 
dendritic response and whisker touch was found to vary as a func- 
tion of the whisker angle during touch (set point: low-pass filtered 
whisker angle; Methods), even for similar touch force magnitude 
ranges (Fig. 5d). Grouped data from mice with a single whisker con- 
tacting the object at different locations revealed a marked tuning of 
the stimulus-response slope to particular whisker set-point locations 
(Fig. 5e). Thus, the variation in dendritic response amplitude at different 
object locations may be the result of a whisker angle set-point-dependent 
modulation of the stimulus—response slope or gain. Such an adjustable 
gain could be produced by whisker set-point-dependent variations in 
vM1 input to the tuft dendrites of layer 5 pyramidal neurons”°’’. These 
data suggest that a circuit-level computation is implemented in the 
distal dendrites of layer 5 pyramidal neurons through the integration 
of correlated sensory and motor information during active touch to 
produce an object-localization signal. 

Our current understanding of dendritic computation during beha- 
viour is limited. Recent recordings from populations of pyramidal- 
neuron dendrites suggest an active role for dendrites in information 
processing during the awake state*’”*. Here we report that novel long- 
duration plateau potentials and associated global dendritic Ca** 
signals (Figs 2 and 3) are produced in the apical dendrites of individual 
layer 5 pyramidal neurons when mice perform an active sensing task. 
The Ca* * -mediated dendritic plateau potentials were probably evoked 
by correlated perisomatic input from ascending sensory drive (Fig. 1) 
and layer 1 synaptic inputs from cortico-cortical feedback connec- 
tions'”® (Fig. 4). This active input processing in pyramidal-neuron 
dendrites seems to underlie a circuit computation for the tactile local- 
ization of salient objects (Fig. 5 and Supplementary Fig. 12). Thus, our 
results suggest that active nonlinear dendritic integration in layer 5 
pyramidal neurons has a central role in the production of a beha- 
viourally related computation in the barrel cortex. Because pyramidal 
neurons throughout the central nervous system (CNS) are capable of 
producing related forms of active dendritic integration’, the above 
correlation-based computations may be a common feature of many 
neuronal circuits’ 1°??”°. 


METHODS SUMMARY 


The GCaMP3 (ref. 11) was expressed using a virus (rAA V-syn-GCaMP3, serotype 
2/1, University of Pennsylvania Gene Therapy Program Vector Core), which was 
injected stereotactically into $1 (20 nl, 750 um below pia) of adult C57BL/6 male 
mice (older than postnatal day 60 (P60)) and a circular craniotomy (1.5-2 mm) 
was made over the left $1. Animals were trained in a head-fixed whisker-based 
object-localization task. Distal apical dendrites from deep layer neurons were 
imaged using a custom two-photon microscope during task performance. Images 
of the whiskers were acquired using a high-speed CMOS camera (EoSense CL, 
Mikrotron) at 500 frames per s. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Chronic imaging window. All procedures were in accordance with protocols 
approved by the Janelia Farm Research Campus Institutional Animal Care and 
Use Committee or Queensland Brain Institute Institutional Animal Care and Use 
Committee. All animals used in awake behaving experiments were adult male 
(older than P60) C57BL/6Crl mice (Charles River). During surgery, mice were 
anaesthetized with isoflurane (~2% by volume in O3; SurgiVet, Smiths Medical). 
A craniotomy (~2 mm in diameter) was made over the left barrel cortex. The dura 
was left intact. Virus-containing solution was slowly injected (20 nl per site, 3 to 4 
sites per animal; depth ~700 ,1m) into the cortex. The injection system was com- 
prised of a pulled glass pipette (broken and bevelled to 25-30 um (outside dia- 
meter); Drummond Scientific, Wiretrol II Capillary Microdispenser) back-filled 
with mineral oil. A fitted plunger was inserted into the pipette and advanced to 
displace the contents using a hydraulic manipulator (Narashige, MO-10). 
Retraction of the plunger was used to load the pipette with virus. The injection 
pipette was positioned with a Sutter MP-285 manipulator. After injection, the 
craniotomy was covered with a double-layered glass coverslip’**’, sealed in place 
with dental acrylic (Jet Repair Acrylic, Lang Dental Manufacturing). The double- 
layered glass was comprised of a 177-200-|1m-thick glass coverslip (diameter 
~1.5mm) attached to a larger glass coverslip (diameter ~5 mm) using ultraviolet 
cured optical adhesives (Norland Optical Adhesives 61). A titanium head-post 
with an opening on the left side was attached to the skull with cyanoacrylate glue 
and dental acrylic to permit head fixation and two-photon imaging over the cranial 
window. Mice were allowed at least 7 days to recover before water restriction. 
Head-fixed mouse behaviour. We designed a head-fixed active object-localization 
task with multiple target-object locations that require the mice to use active 
touch to determine the location of a pole, while providing a variety of stimuli 
to the whiskers (with different object locations). In each trial, the target object 
(a vertical pole) was presented randomly at either one of multiple ‘go’ positions 
or at a single ‘no-go’ position along the anterior—posterior axis on the right side 
of the mice (8-9 mm lateralized from the whisker pad). The go position was 
located within easy reach by whisking, and the location varied from trial to trial 
on the anterior—posterior axis, within a range of ~18 mm (~5 mm posterior to 
~13 mm anterior to the centre of the whisker pad). The no-go position was in an 
anterior position, but within reach of the whiskers. Mice were trained to report the 
detection of an object at one of the go positions by licking a water port, and to 
report an object at the no-go position by withholding licking. Trained mice 
typically did not touch the object in the anterior no-go location. The trial began 
with a delay of 0.5~1s before the pole started to ascend (time of ascent ~0.3 s) 
into the plane of the whiskers (Fig. 1a). The pole stayed in the whisker field for 
1.5~2s (the ‘sampling’ period) before starting to descend out of the whisker 
plane (time of descent ~0.5 s) followed by an ‘answer’ period (1.5~2s) during 
which the mouse responds by licking or withholding licking. Correct no-go res- 
ponses (‘correct rejections’) were not rewarded, and incorrect go responses 
(‘misses’) were not punished. Licks occurring within the answer period were 
recorded as go responses. Correct go responses (‘hits’) were rewarded with a drop 
of water (~8 ul). The trial was paused for 2s to give the mouse time to drink. 
Incorrect no-go responses (‘false alarms’) led to a ‘time-out’ period in which 
the trial was paused for ~2-5s. Licking during this time-out period triggered 
additional time-out periods. Licking outside of answer and time-out periods had 
no consequences. 

Behavioural training began after the mice had restricted access to water for at 
least 7 days (1 ml per day). On days with behavioural sessions, mice generally 
obtained all water for the day during the session (approximately 1 ml). Food 
was available ad libitum. The weight and health of the mice were monitored daily. 
After training, mice learned to respond with licking to whisker-object contact 
regardless of the position of the pole (Supplementary Fig. 2). 

The behavioural apparatus was mounted under a custom two-photon micro- 
scope equipped with a high-speed whisker-imaging system, and was controlled 
by an open-source software system (Z. Mainen and C. Brody) running on 
MATLAB (Mathworks) communicating with a real-time system (TDT and 
RTLinux). Imaging sessions started after the mice had completed the initial train- 
ing and reached a performance criterion of 75% correct responses or d’ > 1.8 
Supplementary Fig. 2a). 

During behaviour, whiskers were illuminated with collimated light using a high- 
power light-emitting diode (LED) source (960 nm, Roithner) and condenser optics 
Thorlabs). Images were acquired through a telecentric lens (X0.36, Edmund 
Optics) by a high-speed complementary metal-oxide semiconductor (CMOS) 
camera (EoSense CL, Mikrotron) running at 500 frames per s (480 X 352 pixels; 
resolution, 26 pixels per mm). Image acquisition was controlled by Streampix 3 
Norpix). Synchronization of behavioural trials and two-photon imaging frames 
was achieved using pulses sent from the real-time behaviour system. 


Two-photon calcium imaging from distal dendrites during behaviour. The 
genetically encoded calcium indicator GCaMP3 (ref. 11) was expressed under the 
human synapsin 1 promoter after infection with recombinant adeno-associated 
virus (AAV, serotype 2/1; produced by the University of Pennsylvania Gene 
Therapy Program Vector Core), which was injected stereotaxically to deep layer 
in barrel cortex. For sparse labelling of layer 5 neurons with GCaMP3, AAV 
carrying floxed GCaMP3 was co-injected with a diluted AAV carrying Cre recom- 
binase (~1:10,000-1:5,000). This approach reduces the cell density of GCaMP3- 
expressing neurons without compromising the expression level. Fifteen to twenty 
days after virus injection, mice began training on the tactile detection task and 
two-photon imaging sessions began after the criterion of 70% correct responses 
was reached (5-7 days after the training began). Sessions analysed here occurred 
between days 25 and 40 post infection. 

GCaMP3 was excited at 925 nm (typically 20-40 mW at the back aperture) with 
a Ti:Sapphire laser (Mai Tai, Spectra Physics) and imaged through a Nikon X16, 
0.8-N.A. objective. Emission light passed through a 565 DCXR dichroic filter 
(Chroma Technology) and a ET525/70m-2p filter (Chroma Technology) and 
was detected by a GaAsP photomultiplier tube (10770PB-40, Hamamatsu). 
Images (512 X 128 pixels) were acquired at ~16 Hz using ScanImage software”. 

Subregions that contain primarily GCaMP-positive apical dendritic trunks 

(from layer 5 neurons) at the depth range of 20-400 tum, but not layer 2 or layer 
3 somas, were used to image distal dendrite activity. A set of initial trials were 
usually used to identify imaging fields that show visually identifiable response 
(corresponding to AF/F > 10%) during the sampling period before a full imaging 
and behaviour session was initiated. Because of the broad horizontal extension of 
tuft dendrites, the imaging regions are usually in the proximity of, but not limited 
to, the barrels corresponding to the present whiskers (determined using intrinsic 
signal imaging). In our data, 23% of active dendrites show touch responses (we 
define active dendrites as those that show discernable increase in fluorescence 
during trials, whether this activity is related to touch or not; see Supplementary 
Figs 2 and 3). This is an overestimate of the fraction of the total population, as less 
than 30% (estimated by number of pixels) of all dendrites in a given field of view 
were active (that is, >70% did not show any detectable change in fluorescence; see 
Fig. 2e and Supplementary Fig. 2a, b). We did not include in our analysis any fields 
of view that contained only inactive dendrites (>50% of the examined fields). 
Thus, although our experiments were not designed to provide an unbiased 
estimate of the fraction of active dendrites, our sampling strategy suggests ~5% 
of all layer 5 dendrites showed touch-related Ca”* signals. 
In vivo dendritic patch-clamp recording. All animals used in this section of the 
study were adult male or female (older than P60) GCaMP3-expressing transgenic 
mice (Thy1, line 10, Charles Zuker, Janelia Farm Research Campus). The animals 
were surgically prepared as described above for the chronic imaging experiments, 
except that a half coverslip was used as an optical window and agar (~1%) was 
applied between the dura—brain surface and the coverslip’. In addition, a portion 
of the dura covering the open surface of the neocortex was either surgically 
removed or partially digested by collagenase treatment (1 mgml' applied to 
the dura using filter paper for 4 min followed by a 5-min wash with bovine serum 
albumin (BSA)) before the application of the optical window. Animals were main- 
tained on ~1.5-2% isoflurane saturated with O2 throughout the surgical and 
experimental session. The craniotomy was perfused with artificial cerebrospinal 
fluid (aCSE) (in mM: 125 NaCl, 25NaHCO,, 1.25NaH,PO,, 3KCl, 1.3 CaCl, 
1.0 MgCh, 25 glucose) saturated with 95% O, and 5% CO>. Whole-cell current- 
clamp recordings were made from tuft dendrites of thick-tufted layer 5 pyramidal 
neurons using a two-photon targeting method described below and a Dagan BVC- 
700 amplifier in ‘bridge’ mode, filtered at 1 kHz and digitized at 50 kHz. Pipettes 
had an open tip resistance of 7-10 MQ when filled with (in mM): 134K- 
gluconate, 6 KCl, 10 HEPES, 4 NaCl, 0.3 Tris2-GTP, 4 MgATP, 14 phosphocrea- 
tine, 0.05 Alexa 594. Pipette capacitance was neutralized just before seal formation 
and the bridge balance was typically 50-100 MQ after break-in. 

To target the distal dendrites of GCaMP-expressing layer 5 neurons, the anaes- 
thetized mice were mounted to a head-fixation device under a custom two-photon 
microscope. GCaMP3 was excited at 900 nm (typically ~20mW at the back 
aperture) with a Ti:sapphire laser (Chameleon Ultra II, Coherent) and imaged 
through a Nikon X16, 0.8-N.A. objective. Emission light passed through a 565 
DCXR dichroic filter (Chroma Technology) and a ET525/70m-2p filter (Chroma 
Technology) and was detected by a GaAsP photomultiplier tube (10770PB-40, 
Hamamatsu). Images (512 X 128 pixels) were acquired at either 8 or 16 Hz using 
ScanImage software. Dendrites were selected for recording based on their mor- 
phology (large-diameter dendritic trunk and extensive tuft arborization). To 
ensure the targeting of layer 5 dendrites, the trunk of the dendrite was traced with 
the two-photon microscope to a depth of at least 400 tm below the cortical surface. 
The two-photon microscope was used to target the recording pipette (filled with 
50 um Alexa 594) to secondary or tertiary branch points in the apical tuft. Once 
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in whole-cell mode, resting membrane potential was assessed (range: —55 to 
—49 mV) and various current steps (from 100 to 1,000 pA, 500 ms) were injected 
to evoke plateau potentials or calcium spikes (input resistance = 38 + 0.3 MQ; 
n= 6). During the current injections, Ca** imaging was performed at ~16 Hz 
over ROIs that contained the recorded dendritic arborization as assessed by red 
fluorescence emission from the Alexa 594. Unpaired electrical events (events out- 
side current injection periods) were selected for characterization using a threshold- 
level detection algorithm (threshold = 20 mV above baseline). Unpaired events 
and plateau potential (threshold = —20 mV) peak time, amplitude and duration, 
as well as Ca**-signal latency (interpolated time at which AF/F exceeded noise 
by 2s.d.) peak amplitude and duration were automatically determined using a 
custom detection algorithm (Igor Por). Event durations are full width at half 
maximum for Ca”* signals or half width (electrical). 

Imaging data analysis. Lateral motion in two-photon images recorded during 
behaviour was corrected in two steps (Supplementary Fig. 3b). All frames from a 
behavioural session were first aligned to a target image frame using an efficient 
cross-correlation-based registration algorithm (single-step discrete fourier trans- 
formation (DFT) algorithm)**. The target image was obtained by mean projection 
of image frames from a trial visually identified to contain still frames. After whole- 
frame alignment, within-frame motion artefacts were corrected with a line-by-line 
registration algorithm using the gradient-descent method”. 

For the data set from densely labelled dendrites with sparse signals, independent 
component analysis (ICA) was used to facilitate defining ROIs**. ICA was carried 
out using a published algorithm, FastICA”, after dimensionality reduction using 
singular value decomposition on the data matrix from a fraction of trials (typically 
50 trials) for each imaging field (each trial contains 128 frames with each frame 
containing 128 X 512 pixels). This method generated independent components 
with hot spots well matching dendritic branch structures that showed transients in 
sampled trials (Supplementary Fig. 3a, b). The number of independent compo- 
nents was empirically determined such that further increases in their number 
extracted only noise. ROIs were then manually outlined to include visually distinct 
hot spots. Because belonging to the same independent component does not neces- 
sarily mean belonging to the same dendritic branch or the same cell, multiple ROIs 
were often defined for the same independent component. This method of ROI 
selection was corroborated by an alternative approach using time projection of 
maximum value of pixel intensity above mean, Fiyyax — Fmean» Where F is the time 
series of fluorescence intensity from each pixel. For each ROI, AF/F (%) was 
calculated as (F — Fo) / Fo X 100, where Fo is the mode defined from the histogram 
of F. ROIs with clear spatial cross-talk were visually identified and excluded for 
analysis. ROIs typically showed two response types, with Ca*~ signals occurring 
either during the whisker sampling epoch (during the presence of the pole; Fig. 1c) 
or in the post-sampling epoch (after the pole was removed from the whisker field, 
responses during this period were often less reliable and there was a lack of time 
locking to tractable behavioural variables; Supplementary Fig. 4). 

To estimate the amplitude and duration of individual Ca** transients during 
behavioural sessions, we used a simple events-detection approach, and measured 
the peak AF/F and full width at half maximum of identified Ca”* events. The 
events detection was based on combined thresholding with both amplitude (>4 
standard deviation, estimated using median absolute deviation (MAD)) and rising 
slope (computed from a span of three frames) of AF/F time series from all trials ofa 
given ROI. Ca** transients during in vivo dendritic patch recording were visually 
identified, from which the amplitudes and durations were estimated. Ca”* events 
during behaviour sessions with a full width at half maximum of less than two 
frames (128 ms) were considered to be noise and were rejected. 

For the analysis of active-sensing-related dendritic Ca’ * signals (Fig. 1c-e), only 
ROIs selective to sampling epoch (compared to the post-sampling epoch, P < 0.05, 
Wilcoxon rank-sum test) were used, and ROIs from the same independent com- 
ponents showing strong correlation with each other were treated as redundant, 
and only one of these was used. For the analysis of signal correlation between 
branches (Fig. 2a-d and Supplementary Fig. 9), ROIs were included regardless 
of their trial epoch selectivity, and for a given branch, only one ROI was included. 
For signal correlation in single cells, ROIs were from sparsely labelled imaging 
fields in which branches could be traced to single apical trunks, and reconstructed 
(Fig. 2a) with the custom-written software neuTube™ using imaging stacks 
(512 X 512 resolution, 1 tm per slice) acquired with a two-photon microscope 
at the end of each experiment. For signal correlation under densely labelled con- 
ditions, branches in a small field of view are likely to originate from many different 
cells, and ROIs from branches were included regardless of their origin. For the 
whisker set-point dependence of the stimulus-response relation shown in Fig. 5e, 
we controlled for potential bias introduced by the alignment of the maximum 
value to 0° by randomly permuting the data points for each ROI into 4 bins. A 
paired t-test was used to compare the maximum slope values from these randomly 
permuted data bins with the maximum slope values from the data binned by 
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set point as in Fig. 5e. The maximum slope value shown in Fig. 5e is significantly 
different to those obtained from randomly permuted data (P < 0.0001). 

Details of the method for extracting whisker variables have been described 
previously'*”’. In brief, the whisker angle at the base of the whisker, 0, was 
extracted from tracked whisker trajectories. A line perpendicular to the midline 
was defined as 0 = 0 and protraction corresponded to increasing angles. Whisking 
set point is the low-pass (<6 Hz) filtered whisker-angle time series [0(t)], cap- 
turing mainly temporally averaged whisker positions. The amplitudes of the forces 
in the follicle are proportional to the curvature of the whisker. We used curvature 
change (Ak; Fig. la) at a particular location along the whisker (2-3 mm) as a 
substitute for the mechanical forces acting on the whiskers. Curvature was deter- 
mined using a second-order polynomial fit to the whisker backbone. Automatic 
detection of whisker—pole contact was based on thresholding Ax with the standard 
deviation estimated using MAD from frames with a minimal whisker-pole dis- 
tance for at lease three consecutive frames (6 ms; Fig. la). The total curvature 
change during contacts, which was used as a metric for overall touch-evoked 
sensory drive in a given trial, is computed by summation of the absolute value 
of peak Ax for each contact period. Unless otherwise indicated, data in the text are 
reported as mean + s.d. 

Passive whisker stimulation. Compressed air was delivered through a glass 
pipette pointing towards a single whisker (C2) near the tip. An air puff to the 
whiskers during the anaesthetized condition is ineffective in evoking any dendritic 
response (Supplementary Fig. 10). Air puff during the awake condition occasion- 
ally evoked dendritic responses. Dendrites often show a distinct response to air 
puff compared to active touch, with a lower threshold response to active touch 
(Supplementary Fig. 10d, e). A small fraction (6%) of dendritic branches 
responded to both air puff and active touch (Supplementary Fig. 10f). To control 
for the potential spread of air puff to the whisker pad and the animal’s face, the 
whiskers were trimmed at the end of experiments, and the air puff was directed 
towards the face. The same dendrites that had showed a response to air puff 
directed at the whiskers did not show any detectable responses to air puff directed 
at the whisker pad or face (data not shown). 

Reversible silencing. To reversibly silence vM1, the GABA (y-aminobutyric acid) 
agonist muscimol hydrobromide (Sigma-Aldrich) was dissolved in saline (5 ug ul), 
and 60 nl were injected slowly (10 nl per min) to vM1 (1.1 mm lateral to midline 
and 0.9 mm anterior to Bregma, depth 550-600 jum under the pia) of the ipsilateral 
side of the S1 imaging window”. For control-site injection, the same dose of 
muscimol was injected to a control cortical area of the same hemisphere with 
the distance to the centre of imaging window the same as that for M1 injection 
(1.1 mm lateral, 3.4mm posterior to Bregma, depth 550-600 um). The animals 
were left to recover for 2h before the behavioural session. To test the possibility 
that the feedback from motor cortex to S1 is also required for air-puff-evoked 
dendritic response during the awake condition, dendritic calcium imaging with 
air-puff stimulation was also conducted on the day of muscimol injection to 
either M1 or to the control site. M1 silencing led to dropped task performance, 
but the animal drank normally when water was freely available, suggesting that M1 
inactivation did not impair licking function. 

In vitro slice preparation and patch-clamp recording. Under control condi- 
tions, recordings were made from layer 5B pyramidal neurons visualized in near- 
coronal brain slices (300 um) of the somatosensory neocortex prepared from 
Wistar rats (4-8 weeks old). Slices were perfused with a solution of composi- 
tion (mM): 125NaCl; 25NaHCO;; 3KCI; 1.25NaH,PO,; 2(or 1.3) CaCl; 
1 MgCl;, 3 Na pyruvic acid and 25 glucose at 35 to 37°C bubbled with 95% O, 
and 5% CO . Dual whole-cell recordings were made with identical current-clamp 
amplifiers (BVC-700A; Dagan Corporation). Pipettes were filled with (mm) 
135 K-gluconate; 7 NaCl; 10 HEPES; 2 Na-ATP; 0.3 Na-GTP; 2 MgCl, and 0.01 
Alexa Fluor 568 (Molecular Probes, Eugene) (pH 7.3-7.4; KOH) and had open-tip 
resistance of 8-12 MW. Current and voltage signals were low-pass filtered (DC to 
10 KHz) and acquired at 30-50kHz. Data were acquired and analysed using 
AxographX software (AxographX). For each neuron, the apical dendritic record- 
ing pipette was positioned as close as possible to the nexus of the apical dendritic 
trunk (21 + 9mm, n = 13) anda second pipette positioned at an apical dendritic 
tuft site under infrared differential interference microscopy. At the termination 
of each whole-cell recording, the location of recording pipettes and neuronal 
morphology was examined by fluorescence microscopy and digitally recorded 
(Retiga EXI, QImaging). 

For in vitro recordings from GCaMP-expressing layer 5 pyramidal neurons, 
acute coronal slices containing the barrel and/or somatosensory cortices were 
prepared from 4-7-week-old male wistar rats (Charles River, OGB-6F experi- 
ments) or from 6-8-week-old male wistar rats injected with GCaMP3 virus at 
3-3.5 weeks of age. Animals were deeply anaesthetized with isoflurane saturated 
with O, and decapitated; the brain was rapidly removed and sectioned at 300 um 
with a Leica VT1200S vibrating tissue slicer in ice-cold slicing aCSF (in mM: 
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125 NaCl, 25 NaHCOs, 1.25 NaH2PO,, 3 KCl, 1.0 CaCl, 5.0 MgCl, 25 glucose, 
3 pyruvate, 1 ascorbate) saturated with 95% O, and 5% COs. Slices recovered 
at 34°C for 20-30min and were subsequently stored at room temperature 
(20-25 °C) until use. Whole-cell current-clamp recordings were made from 
thick-tufted layer 5 pyramidal-neuron distal apical trunks or primary tuft den- 
drites using conventional dodt optics with a Dagan BVC-700 amplifier in ‘bridge’ 
mode, filtered at 1-5 kHz and digitized at 50 kHz. Experiments were performed at 
34-37 °C in aCSF containing (in mM): 125 NaCl, 25 NaHCOs, 1.25 NaH2PO,, 
3 KCl, 1.3 CaCl, 1.0 MgCl, 25 glucose, 3 pyruvate, 1 ascorbate. Pipettes had an 
open tip resistance of 4-6 MQ when filled with (in mM): 134 K-gluconate, 6 KCl, 
10 HEPES, 4 NaCl, 0.3 Tris2GTP, 4 Mg, ATP, 14 phosphocreatine, 0.05 Alexa 594. 
After pipette capacitance neutralization and bridge balance (typically between 
12 and 24MQ after break-in, experiments were terminated if series resistance 
(Rs) exceeded 40 MQ) current steps were injected to characterize basic response 
properties. Dendrites were only accepted for recording if they exhibited prominent 
h-current sag (H-sag; 40-80%), a resting membrane voltage of more negative than 
-53 mV, and a short-latency, rapidly rising voltage spike that crossed 0 mV in 
response to large depolarizing current injection (‘plateau potentials’ or ‘calcium 
spikes’, rheobase of 0.8 to 2.0 nA). All patched dendrites were confirmed to origin- 
ate from intact somas in layer 5. Synaptic stimulation was carried out in the presence 
of 2.5 uM GABAzine (SR95331) with a bipolar stimulating electrode placed in layer 
1 (~50-100 tum below the pia and ~200-400 tm lateral to the dendritic patch 
pipette). Each synaptic stimulation experiment imaged at least two separate 
branches to confirm widespread back-propagation of the plateau potential. 

In vitro two-photon imaging and uncaging. Two-photon imaging and uncaging 
was carried out using a dual galvanometer-based scanning system (Prairie Tech- 
nologies) coupled with two pulsed, ultra-fast Ti:sapphire lasers (Chameleon Ultra 
II, Coherent) with their intensity controlled separately by two electro-optical 
modulators (Conoptics). Alexa 594 was excited at 880 nm to visualize morphology 
while OGB-6F (100 1M) and GCaMP3 were excited at 92nm with line-scans 
through dendrites of interest at high magnification with dwell times of 8-12 1s 


at 150-500 Hz. GCaMP3 and OGB-6F signals are expressed as AF/F (%) (calcu- 
lated as ((F—Fyasetine) / Foaseline) X 100). Between 3 and 12 individual line-scans 
were averaged for each condition. Care was taken to collect data only from 
dendrites that were at least 30 tm (and up to 150 um) below the surface of the 
slice that were not prematurely cut off before termination: we consistently 
observed morphological abnormalities and significantly higher resting AF/F along 
with substantial plateau-evoked GCaMP or OGB-6F signals in branches that did 
not meet these criteria (data not shown), presumably owing to the effects of 
structural damage on their electrotonic architecture. Branches were anatomically 
defined as incrementing from 0° at the apical trunk. For fast, multi-site glutamate 
uncaging, 10 mM MNI-glutamate or 1-5 mM RUBI-glutamate (Tocris; dissolved 
in aCSF) was delivered via pressure ejection through a specially designed puffer 
pipette to the surface of the slice while focused 720-nm (MNI) or 800-nm (RUBI) 
laser light was rapidly directed to a sequence of pre-selected points near spine 
heads (10-20 points over 20-30 j1m, 0.2 ms dwell time, 0.1 ms move time). 


31. Komiyama,T. etal. Learning-related fine-scale specificity imaged in motor cortex 
circuits of behaving mice. Nature 464, 1182-1186 (2010). 

32. Pologruto, T.A., Sabatini, B. L. & Svoboda, K. Scanlmage: flexible software for 
operating laser scanning microscopes. Biomed. Eng. Online 2, 13 (2003). 

33. _ lyer, V. et al. Scanimage for in vivo laser scanning microscopy Program No. 485.2. 
2009 Neuroscience Meeting Planner (Society for Neuroscience, 2009). 

34. Guizar-Sicairos, M., Thurman, S. T. & Fienup, J. R. Efficient subpixel image 
registration algorithms. Opt. Lett. 33, 156-158 (2008). 

35. Greenberg, D. S. & Kerr, J. N. D. Automated correction of fast motion artifacts for 
two-photon imaging of awake animals. J. Neurosci. Methods 176, 1-15 (2009). 

36. Mukamel, E.A., Nimmerjahn, A. & Schnitzer, M. J. Automated analysis of cellular 
signals from large-scale calcium imaging data. Neuron 63, 747-760 (2009). 

37. Hyvarinen, A. & Oja, E. Independent component analysis: algorithms and 
applications. Neural Netw. 13, 411-430 (2000). 

38. Zhao, T. et al. Automated reconstruction of neuronal morphology based 
on local geometrical and global structural models. Neuroinformatics 9, 247-261 
(2011). 


©2012 Macmillan Publishers Limited. All rights reserved 


| sid ial Be 


doi:10.1038/nature11603 


Interleukin receptor activates a MYD88-ARNO-ARF6 
cascade to disrupt vascular stability 


Weiquan Zhu'*, Nyall R. London!*3*, Christopher C. Gibson”**, Chadwick T. Davis’, Zongzhong Tong®, Lise K. Sorensen”, 
Dallas S. Shi®®, Jinping Guo”, Matthew C. P. Smith'?, Allie H. Grossmann”, Kirk R. Thomas’? & Dean Y. Lib?*?!° 


The innate immune response is essential for combating infectious 
disease. Macrophages and other cells respond to infection by 
releasing cytokines, such as interleukin-1B (IL-1B), which in turn 
activate a well-described, myeloid-differentiation factor 88 
(MYD88)-mediated, nuclear factor-kB (NF-«KB)-dependent tran- 
scriptional pathway that results in inflammatory-cell activation 
and recruitment'*. Endothelial cells, which usually serve as a 
barrier to the movement of inflammatory cells out of the blood 
and into tissue, are also critical mediators of the inflammatory 
response”®. Paradoxically, the cytokines vital to a successful immune 
defence also have disruptive effects on endothelial cell-cell inter- 
actions and can trigger degradation of barrier function and dis- 
sociation of tissue architecture’’. The mechanism of this barrier 
dissolution and its relationship to the canonical NF-KB pathway 
remain poorly defined. Here we show that the direct, immediate 
and disruptive effects of IL-1B on endothelial stability in a human 
in vitro cell model are NF-KB independent and are instead the 
result of signalling through the small GTPase ADP-ribosylation 
factor 6 (ARF6) and its activator ARF nucleotide binding site 
opener (ARNO; also known as CYTH2). Moreover, we show that 
ARNO binds directly to the adaptor protein MYD88, and thus 
propose MYD88-ARNO-ARF6 as a proximal IL-1 signalling 
pathway distinct from that mediated by NF-kB. Finally, we show 
that SecinH3, an inhibitor of ARF guanine nucleotide-exchange 
factors such as ARNO, enhances vascular stability and significantly 
improves outcomes in animal models of inflammatory arthritis 
and acute inflammation. 

A defining characteristic of the cytokine-induced inflammatory 
response is the destabilization of endothelial barriers resulting in vas- 
cular permeability’ °. To dissect the pathway(s) involved in this tissue 
disruption, we treated cultured monolayers of human dermal micro- 
vascular endothelial cells (HMVEC-d) with IL-18, and detected an 
increase in endothelial permeability within 15min (Fig. la). The 
canonical IL-1 pathway involves ligand-stimulated activation of 
interleukin-1 receptor (IL-1R), which recruits MYD838 to its cytoplas- 
mic tail’®. The subsequent signalling cascade through IRAK1 results in 
the phosphorylation of IkB-« by the IkB kinase (IKK) complex, lead- 
ing to translocation of NF-KB to the nucleus and the eventual tran- 
scription of target genes that promote inflammatory-cell responses** 
(Supplementary Fig. 1). To test the involvement of this pathway in IL- 
1f-induced vascular permeability, cells were treated with the IKK 
inhibitor SC-514 (ref. 11). Although SC-514 prevented IL-1B-induced 
nuclear localization of NF-«B, it was unable to rescue either IL-1f- 
induced permeability or disruption of vascular endothelial (VE)- 
cadherin surface localization (Fig. 1b, c and Supplementary Fig. 2a-c). 
We also wondered whether IL-1B-induced vascular permeability required 
other known MYD88-mediated downstream signalling mechanisms, 


including ERK1/2, p38 and JNK (also known as MAPK3/MAPK1, 
MAPK14 and MAPK8, respectively)’*”*. Although ERK1/2, p38 and 
JNK were activated by IL-1 stimulation of endothelial cells, small- 
molecule inhibitors of each of these pathways were unable to prevent 
IL-1B-induced vascular permeability or IL-1B-induced disruption 
of VE-cadherin cell-surface localization (Supplementary Fig. 2d-h). 
Although specific NF-«B targets, such as VEGFA, COX-2 (also known 
as PT'GS2) and the COX-2 product prostaglandin E, are modulated by 
IL-1, their activation had no effect on IL-1f-induced endothelial 
permeability*’*’° (Supplementary Fig. 3a-e). Finally, treatment with 
actinomycin D or cycloheximide effectively inhibited transcription or 
translation, respectively, of NF-«B targets, but did not blunt IL-1B- 
induced permeability (Fig. 1d and Supplementary Fig. 3f, g). These 
data strongly support a role for the immediate and destabilizing effects 
of IL-1B on endothelial stability through signalling pathways inde- 
pendent of NF-«B, transcription and translation. 

IL-18 can disrupt VE-cadherin cell-surface localization by promo- 
ting endocytic internalization’. We proposed that IL-1f might use 
ARF6, a known regulator of adherens protein localization'®’”’. 
Indeed, IL-1B activated ARF6 in HMVEC-d within 1 min, a response 
accompanied by increased endocytosis of VE-cadherin within 5 min 
and an increase in monolayer permeability within 15 min (Fig. la, e 
and Supplementary Fig. 4a, b). IL-1B treatment did not affect total VE- 
cadherin messenger RNA or protein levels (Supplementary Fig. 4c, d). 
Adenoviral-mediated overexpression of constitutively active ARF6 
(ARF6(Q67L))'* elicited a dose-dependent increase in endothelial per- 
meability, as well as a disruption of VE-cadherin cell-surface locali- 
zation (Fig. 1f, g and Supplementary Fig. 4e, f). A similar dose- 
dependent loss of total VE-cadherin was also observed, probably 
through internalization and subsequent degradation (Supplementary 
Fig. 4g). Interestingly, at lower doses of adenovirus at which permea- 
bility was still induced, loss of total VE-cadherin was not observed, but 
a dose-dependent loss of cell-surface VE-cadherin occurred (Fig. 1f-h 
and Supplementary Fig. 4g). Moreover, short interfering RNA 
(siRNA) knockdown of ARF6 enhanced VE-cadherin cell-surface 
localization and prevented both IL-1f-induced disruption of VE- 
cadherin and IL-1B-induced endothelial permeability (Fig. 1i-k and 
Supplementary Fig. 4h, i). Collectively, these data link ARF6 as a 
critical regulator of VE-cadherin trafficking by controlling cell-surface 
localization and the immediate and disruptive effects of IL-1B-induced 
vascular permeability. 

The ARF6 activation state is decreased through interaction with 
GTPase-activating proteins (GAPs) and increased through interaction 
with guanine nucleotide-exchange factors (GEFs). Consistent with 
this, treatment of endothelial cells with the ARF-GAP inhibitor 
QS11 evoked an increase in ARF6-GTP, a decrease in VE-cadherin 
cell-surface localization and increased permeability (Fig. 2a-d and 
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Figure 1 | Immediate effects of IL-1B are NF-«B independent and ARF6 
dependent. a—d, Monolayers of HMVEC-d stimulated with IL-1 and assayed 
for permeability to horseradish peroxidase (HRP) over time (a), permeability to 
HRP following 2-h treatment with SC-514 (b), immunofluorescent localization 
of VE-cadherin (c) and permeability to HRP after 30-min treatment with 
actinomycin D or cycloheximide (d). e, IL-1P-stimulated HMVEC-d lysates 
were precipitated with glutathione S-transferase (GST)-GGA3 and 
immunoblotted for ARF6. f-i, HMVEC-d monolayers infected with adenovirus 
(Ad) containing empty vector (null), green fluorescent protein (GFP) or 
ARF6(Q67L), showing permeability (f) and VE-cadherin localization (g) (green 
denotes ARF6 expression). Membrane fractions from adenoviral-vector- 
infected (h) or ARF6-siRNA-treated (i) HMVEC-d were immunoblotted for 
VE-cadherin. j, k, ARF6-siRNA-treated HMVEC-d stimulated with IL-1B, 
showing VE-cadherin localization (j) (arrows denote disrupted VE-cadherin 
cell-surface localization) and permeability (k). A.U., arbitrary unit; DMSO, 
dimethylsulphoxide; IB, immunoblot; TCL, total cell lysate. n = 3; error 

bars denote s.e.m. *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001. 


Supplementary Fig. 5a, b)'®. We noted that a class of ARF-GEFs, the 
cytohesins, is highly expressed in multiple types of endothelial cells 
(Supplementary Fig. 5c). Accordingly, treatment of HMVEC-d with 
SecinH3, a cytohesin inhibitor, significantly increased endothelial cell- 
surface localization of VE-cadherin (Supplementary Fig. 5d-f)”°. 
Notably, SecinH3 inhibited IL-1B-induced ARF6-GTP, as well as 
IL-1B-induced disruption of VE-cadherin cell-surface localization and 
endothelial permeability (Fig. 2e-g and Supplementary Fig. 5g, h). 
To determine which GEF might be uniquely involved, we used 
siRNA to knockdown CYTH1, ARNO and CYTH3 (coding for cyto- 
hesins 1, 2 and 3, respectively) and GEP100 (also known as IQSEC1) 
(Supplementary Fig. 6a), and found that only siRNA targeting ARNO 
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Figure 2 | Inhibition of ARF-GAPs and ARF-GEFs affect ARF6 activation 
and VE-cadherin localization. a-d, HMVEC-d treated with QS11, showing 
whole-cell lysates precipitated with GST-GGA3 and immunoblotted for 
ARF6 (a), membrane fractions immunoblotted for VE-cadherin 

(b), immunofluorescence for VE-cadherin (red) (c) and permeability to HRP 
(d). e-g, HMVEC-d treated with IL-1 and SecinH3, showing immunoblotting 
for ARF6 (e), immunofluorescence for VE-cadherin (f) and permeability to 
HRP (g). h-j, HMVEC-d treated with anti-ARNO siRNA and stimulated with 
IL-1, showing permeability to HRP (h), GST-GGA3 precipitation and ARF6 
immunoblotting (i) and immunofluorescence of VE-cadherin (j). Arrow 
denotes less cell-surface VE-cadherin. n = 3; error bars denote s.e.m. *P < 0.05, 
**P <0.01, ***P < 0.001. 


completely blocked IL-1B-induced endothelial permeability and pheno- 
copied the knockdown of ARF6 (Fig. 2h and Supplementary Fig. 6b). 
Furthermore, ARNO siRNA inhibited IL-1f-induced ARF6-GTP 
formation, IL-1B-induced disruption of VE-cadherin cell-cell contacts 
and IL-1B-induced internalization of surface VE-cadherin (Fig. 2i, j 
and Supplementary Fig. 6c, g); all of these effects were rescued by the 
expression of siRNA-resistant ARNO (Supplementary Fig. 6d-g). In 
cells treated with ARNO siRNA, viral expression of siRNA-resistant 
ARNO, but not viral expression of siRNA-resistant ARNO(E156K) 
(carrying a mutation in the Sec7 domain), rescued the disruption of 
IL-1-induced ARF6-GTP formation and permeability’ (Supplemen 
tary Fig. 6d, e). These data demonstrate that ARNO is a critical ARF- 
GEF necessary for IL-1B-induced activation of ARF6 and subsequent 
induction of vascular permeability, but does not rule out the role of 
other GEF family members in similar responses in different cell types 
or in response to different cytokines. 

The signalling components in the NF-KB pathway downstream of 
IL-1B-induced activation of IL-1R are well characterized (Supplemen- 
tary Fig. 1). Although the inhibition of the NF-KB pathway at the 
level of IRAK1 by siRNA did not inhibit IL-1B-induced ARF6 acti- 
vation, siRNA knockdown of MYD88 inhibited both IL-1B-induced 
permeability and ARF6-GTP activation, suggesting a bifurcation of 
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IL-1B-induced signalling at the point of MYD88 (Fig. 3a-c and 
Supplementary Fig. 7a, b). The proposed bifurcation was further 
verified by the pharmacological uncoupling of the two pathway arms: 
although SecinH3 blunted IL-1B-induced permeability, it did not 
significantly inhibit IL-1B-induced NF-«B nuclear localization or 
NF-«B-dependent expression or localization of cell-surface adhesion 
molecules (Fig. 3d and Supplementary Fig. 7c, d). Furthermore, 
SecinH3 was unable to inhibit IL-1B-induced polymorphonuclear 
leukocyte rolling and adherence under shear stress on an endothelial 
monolayer (Supplementary Fig. 7e). Mechanistic support for this 
novel signalling arm was provided by the demonstration of an inter- 
action between MYD88 and ARNO by co-immunoprecipitation in 
both overexpression and endogenous settings (Fig. 3e, f). Our hypo- 
thesis that ARNO is the critical GEF in IL-1B-induced permeability in 
endothelial cells was further strengthened by our inability to detect an 
interaction between MYD88 and other potentially relevant ARF-GEFs 
including CYTH1, CYTH3 and GEP100 (Supplementary Fig. 7f, g). 
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Figure 3 | The immediate IL-1P-induced permeability pathway diverges at 
MYD88. a, IRAKI-siRNA-treated HMVEC-d, stimulated with IL-1, 
subjected to ARF6-GTP pull-down and immunoblotted for ARF6. 

b, c, MYD88-siRNA-treated HMVEC-d stimulated with IL-1, showing 
permeability and ARF6 activation. d, NF-KB p65 (also known as RELA) 
immunofluorescence in HMVEC-d stimulated with IL-1 and SecinH3. 
Arrows denote nuclear localization. e, Cell lysates from Ad-ARNO-MYC- 
infected HMVEC-d immunoprecipitated (IP) with anti-MYC antibodies and 
immunoblotted with anti-MYD88 antibodies. f, Lysates from HMVEC-d 
immunoprecipitated with anti- MYD88 antibodies and immunoblotted with 
anti-ARNO antibodies. n = 3. Error bars denote s.e.m. *P < 0.05, **P< 0.01, 
***P < 0.001. 
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An effective therapeutic strategy to combat numerous inflammatory 
conditions is to target pro-inflammatory cytokines proximal to the NF- 
kB pathway. However, this strategy can result in undesired pleiotropic 
effects. We wanted to know whether targeting a single arm in this 
pathway—the one mediated by ARNO-ARF6—could inhibit acute or 
chronic inflammation in vivo in two animal models of inflammation. 
The first model we tested was rheumatoid arthritis, a disease characte- 
rized by a dysregulated cytokine response causing excessive inflam- 
mation and tissue damage and treated therapeutically in humans with 
the anti-cytokine tumour necrosis factor receptor (TNFR)-Fc fusion 
protein etanercept (Enbrel)***°. A standard animal model of arthritis 
through which a TNFR fusion approach has been proven effective is 
collagen-induced arthritis (CIA)****. Exposure of animals to the cyto- 
hesin inhibitor SecinH3 after the onset of CIA reduced vascular per- 
meability in the joints, but had no effect on global cytokine levels at 24h 
after treatment initiation (Fig. 4a and Supplementary Fig. 8a, b). In 
addition, a significant inhibition in the increase in arthritic index, 
comparable to that achieved by treatment with Enbrel, was observed. 
The arthritic index is a scoring system determined by the number of 
digits or joints that are oedematous or erythematous. The significance 
of our findings was verified by histologic scoring of inflammation, 
pannus development, cartilage damage and bone damage (Fig. 4c, 
d). A similar effect of SecinH3 was confirmed in a second model of 
inflammation, the carrageenan air-pouch model. Six hours after an 
inflammatory stimulus, a time at which substantial inflammation was 
induced in the positive control mice, treatment with SecinH3 decreased 
exudate volume as well as leukocyte concentration in the exudates 
(Supplementary Fig. 8c, d). Collectively, these data identify MYD88- 
ARNO-ARF6 as a valid target for inflammatory conditions confirming 
a relevant role for manipulation of this pathway in vivo to modulate 
inflammatory processes and in the treatment of disease. 
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Figure 4 | Inhibition of ARF-GEFs decreases collagen-induced vascular 
permeability and arthritis in mice. a, Arthritis-induced vascular permeability 
in the joint measured by Evans Blue leak 7 days after treatment initiation in the 
presence of SecinH3 or Enbrel. n = 14 per group. b, Chronological arthritic 
assessment. n = 10 per group. Significance values are measured against CIA 
plus vehicle. c, Haematoxylin and eosin staining of sections through joints. Bn, 
bone; HC; hyaline cartilage; JS, joint space; Pn, inflamed pannus. Asterisk 
denotes cartilage and bone loss; arrows denote eroded cartilage. d, Summary of 
histological changes of inflammation, pannus, cartilage and bone damage of 
indicated treatment. Control, n = 5; Enbrel and SecinH3, n = 10. Error 

bars denote s.e.m. *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001 versus 
disease group. 
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Chronic inflammation causes tissue destruction through dysregu- 
lated cytokine release, inflammatory cell recruitment and vascular 
permeability; yet each of these mechanisms has critical roles in many 
physiologic processes, including the immune response*’. We have 
identified a novel pathway that uncouples cytokine effects on vascular 
stability from other critical functions of the canonical NF-«B tran- 
scription program (Supplementary Fig. 1). Our model suggests the 
potential for inhibition of vascular leak without modulation of 
immune-cell adhesion or other critical NF-«B-dependent responses. 

The activation of many inflammatory cytokine receptors disrupts 
cell-cell interactions, precipitating tissue oedema and destruc- 
tion®”””’, Toll-like receptors and the interleukin receptor also use 
MYD88, and the mechanism described here may well apply*"®. 
Interestingly, TNFR1 (also known as TNFRSF1A) does not use 
MYD838 yet still activates ARF6-GTP after stimulation (Supplemen- 
tary Fig. 9). Whether ARNO or another ARF-GEF binds directly to 
TNFR1 or its adaptor protein TRADD is unknown, as is the possibility 
that ARF-GEF-ARF6-cadherin serves as a common signalling mo- 
dule exploited by multiple cytokines. Although this study focused on 
the endothelium, the concept of cytokine receptor-ARF-cadherin 
may also apply to the epithelial barrier, which expresses these consti- 
tuents and is also compromised by cytokines including IL-1f"*. 

Inhibition of this novel vascular-stability pathway, which is aimed at 
enhancing the resilience of the host to the cytokine response, shows 
effects commensurate to those of class-leading drugs that target cyto- 
kines upstream of NF-«B and seek to blunt the cytokine response of 
the immune system outright. This approach may be particularly useful 
in arthritis, as the current medical therapy can render a patient immu- 
nocompromised and susceptible to reactivation of infectious disease 
such as tuberculosis”. Application of these findings to other diseases 
characterized by excessive acute or chronic inflammatory states, 
including sepsis, Crohn’s disease, ulcerative colitis, scleroderma and 
psoriasis, should also be considered®”?”*. 


METHODS SUMMARY 
Transwell permeability. HMVEC-d cells were seeded on 1.0-j1m Costar transwell 
inserts coated with fibronectin. Cells were grown to confluency and treated with 
SecinH3 for 3 h or MAPK/NF-«B/transcription/translation inhibitors for 30 min 
followed by treatment with 10 ng ml" IL-1. Alternatively, cells were infected with 
Ad-GFP or Ad-ARF6(Q67L) for 48h. siRNA knockdown was performed as 
described in Supplementary Methods, and cells were treated with IL-1 72 h after 
the second siRNA transfection. Two hours later, HRP was added to the top chamber 
ata final concentration of 100 pg ml’. Medium was removed after 60 min from the 
lower chamber. For time-course transwell assays and transcription/translation- 
inhibitor experiments (Fig. 1a, c, d), HRP was added to the insert at the same time 
as IL-1B. Transwell inserts were moved to fresh wells after each time point, and the 
concentration of HRP in the bottom chamber was measured for monolayer per- 
meability. HRP was assessed using media samples obtained from the lower chamber 
incubated with 0.5 mM of guaiacol and 0.6 mM H,0O3. Spectrophotometric analysis 
of absorbance at 490 nm provided a quantitative evaluation of the amount of HRP 
that crossed the membrane. Data are presented as mean ~ s.e.m. of at least three 
independent experiments performed in quadruplicate. 

A detailed description of all methods is provided in Supplementary Information. 
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A soybean cyst nematode resistance gene points to a 
new mechanism of plant resistance to pathogens 
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Soybean (Glycine max (L.) Merr.) is an important crop that pro- 
vides a sustainable source of protein and oil worldwide. Soybean 
cyst nematode (Heterodera glycines Ichinohe) is a microscopic 
roundworm that feeds on the roots of soybean and is a major 
constraint to soybean production. This nematode causes more 
than US$1 billion in yield losses annually in the United States 
alone’, making it the most economically important pathogen on 
soybean. Although planting of resistant cultivars forms the core 
management strategy for this pathogen, nothing is known about 
the nature of resistance. Moreover, the increase in virulent popula- 
tions of this parasite on most known resistance sources necessitates 
the development of novel approaches for control. Here we report 
the map-based cloning of a gene at the Rhg4 (for resistance to 
Heterodera glycines 4) locus, a major quantitative trait locus con- 
tributing to resistance to this pathogen. Mutation analysis, gene 
silencing and transgenic complementation confirm that the gene 
confers resistance. The gene encodes a serine hydroxymethyltrans- 
ferase, an enzyme that is ubiquitous in nature and structurally 
conserved across kingdoms. The enzyme is responsible for intercon- 
version of serine and glycine and is essential for cellular one-carbon 
metabolism. Alleles of Rhg4 conferring resistance or susceptibility 
differ by two genetic polymorphisms that alter a key regulatory pro- 
perty of the enzyme. Our discovery reveals an unprecedented plant 
resistance mechanism against a pathogen. The mechanistic know- 
ledge of the resistance gene can be readily exploited to improve nema- 
tode resistance of soybean, an increasingly important global crop. 

The first quantitative trait loci (QTL) for resistance to Heterodera 
glycines (rhg) were identified in the early 1960s*”. Resistance QTL on 
chromosomes 18 (rhg1) and 8 (Rhg4) have been consistently mapped 
in a variety of soybean germplasm and represent the major sources of 
resistance in soybean cultivars’. In the soybean cultivar (cv.) Forrest, 
resistance to soybean cyst nematode (SCN) requires both rhg] and 
Rhg4, with Rhg4 exhibiting dominant gene action’. Roots of plants 
carrying Rhg genes are penetrated by infective juveniles, but feeding 
cells ultimately degenerate, causing the nematodes to die before reaching 
adult stages®. Genetic variability in H. glycines is prevalent, and nema- 
todes that survive on resistant cultivars carry the undefined ror (repro- 
duction on a resistant host) alleles’, leading to population shifts in the 
field as a consequence of resistant soybean monoculture’. So far, our 
understanding of resistance to SCN remains limited because the genes 
underlying resistance QTL have not been cloned”"®. 

We report here the positional cloning of the Rhg4 gene from soybean 
cv. Forrest (Fig. 1a). For this purpose, three F,., recombinant inbred 
line (RIL) populations segregating for SCN resistance were developed. 
From a total of 355 recombinant lines identified with chromosomal 
breakpoints at the Rhg4 locus, two recombinants (ExF74 and FxW5093) 
were crucial in defining the interval carrying the Rhg4 gene. Both lines 


carry the resistance allele at the rhg1 locus and are double recombi- 
nants for an 8-kilobase (kb) interval carrying the Rhg¢ resistance allele. 
Two genes, one coding for a serine hydroxymethyltransferase (SHMT) 
and the other a subtilisin-like protease (SUB1), were identified in the 
interval. An alignment of SUB1 cDNA sequences from Forrest, Essex 
and Williams 82 cultivars indicates that the two amino acid differences 
between Forrest and Essex do not correlate with SCN resistance 
(Supplementary Fig. 1). In addition, no nucleotide differences were 
identified in the 1,766 base pairs (bp) of sequence 5’ of the predicted 
translational start site for SUB1 between Forrest and Essex. A compar- 
ison between the genomic DNA sequences of SHMT from Forrest and 
Essex identified five nucleotide differences (three single nucleotide 
polymorphisms (SNPs) and two insertions/deletions (indels)) between 
the resistant and susceptible alleles (Fig. 1a). Two of the nucleotide 
differences found between the Forrest and Essex SHMT cDNAs result 
in an amino acid change in the predicted protein sequences (R130P 
and Y358N) (Fig. 1a). Furthermore, three nucleotide differences were 
identified in the 2,339 bp of sequence 5’ of the predicted start site for 
SHMT between Forrest and Essex (Supplementary Fig. 2a). On the 
basis of these findings, SHMT was characterized further for a role in 
SCN resistance. 

Using TILLING’””, we identified two Forrest mutants, F6266 and 
F6756, carrying mutations in the SHMT gene on chromosome 8 that 
lead to missense mutations at amino acid positions 61 (E61K) and 125 
(M125]), respectively (Fig. 1b). The M125I mutation is predicted to be 
deleterious (sorting intolerant from tolerant (SIFT) score = 0.02) to 
the protein. Both mutants are more susceptible to SCN (Fig. 1c). In the 
segregating M3 F6756 (M125) mutant plants, the mutation is directly 
correlated with the SCN resistance phenotype of individual plants. 

To establish a link between SHMT alleles and soybean resistance to 
SCN, we scored 81 soybean lines representing 90% of the sequence 
diversity in soybean’* for SCN female index and then determined their 
SNP-based SHMT haplotype (Supplementary Table 1). The SHMT 
gene was fully sequenced from 28 selected soybean lines (Fig. 2). 
Eight different SHMT haplotypes were identified. Soybean lines with 
haplotypes H1-H3 carry resistant alleles at SHMT and rhgl and are 
resistant to SCN. These include soybean lines PI 548402 (Peking), 
Forrest, PI 90763, PI 437654 and PI 89772, all of which exhibit 
‘Peking-type’ resistance, which requires both rhg] and Rhgé (ref. 5). 

Further evidence that the SHMT gene confers SCN resistance comes 
from knockdown studies using virus-induced gene silencing (VIGS) 
and targeted RNA interference (RNAi). We cloned 328 bp of SHMT 
into bean pod mottle virus (BPMV) RNA-2 (ref. 14) and generated 
infected tissue. Silencing of the SHMT gene in the SCN-resistant RIL 
ExF67 by inoculation with BPMV-SHMT results in an increase 
in susceptibility to SCN compared to ExF67 inoculated with BPMV 
only (Fig. 3a; P<0.0001). At the time of nematode inoculation, 
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Figure 1 | Rhg4 positional cloning and functional validation of SHMT by 
TILLING. a, Top: high-density genetic map of the Rhg4 locus showing double 
recombinants ExF74 and FxW5093 using Williams 82 sequence as reference. 
Red arrows represent DNA markers with Forrest alleles. BAC clone 100B10 
encompasses a partial SHMT sequence. Middle: SHMT gene model and 
polymorphisms between Forrest and Essex. Bottom: SHMT predicted protein 
sequence showing amino acid differences (R130P and Y358N) between Forrest 
and Essex. b, Identification of two missense TILLING mutations (E61K and 
M1251) within the SHMT. MC, median sequence conservation. c, SCN 
phenotype of the TILLING mutants. FI, female index. Bars, s.e.m. 


SHMT transcript levels were determined by quantitative polymerase 
chain reaction with reverse transcription (qRT-PCR) to have decreased 
by an average of 74% in the roots of plants inoculated with BPMV- 
SHMT compared with those inoculated with BPMV only (Fig. 3b). For 
targeted RNAi, a 338-bp double-stranded RNA (dsRNA) corres- 
ponding to SHMT was expressed under the control of an SCN- 
inducible zinc finger transcription factor promoter’ in soybean hairy 
roots. Nematode reproduction on hairy roots of the SCN-resistant RIL 
ExF67 transformed with pZF-SHMTi is greater than on ExF67 hairy 
roots transformed with pZF-GUSi control (Fig. 3c; P< 0.01). 

Next we fused 2.3 kb of SHMT promoter sequence from Forrest and 
Essex cultivars with the B-glucuronidase (GUS) reporter gene. In unin- 
fected roots, SHMT expression is observed within the vasculature 
of roots (Supplementary Fig. 3). Nematode infection experiments of 
soybean hairy roots of ExF67 and ExF63 transformed with the Forrest 
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pSHMT-GUS construct confirm expression of SHMT within syncytia 
of both lines (Fig. 3d-f). The same pattern of GUS expression is 
observed in nematode-infected soybean hairy roots of ExF63 trans- 
formed with the Essex pSsHMT-GUS construct (Supplementary Fig. 
2b). These data verify SHMT expression in nematode feeding sites of 
both resistant and susceptible soybean and indicate that differences in 
Rhg4 expression are unlikely to explain the resistance phenotype. 

To confirm that SHMT is the resistance gene at the Rhg4 locus, we 
transformed the SCN-susceptible RIL ExF63 with a 5.1-kb genomic 
fragment that includes the Forrest SHMT gene and the 2.3 kb of 
sequence upstream of the start and the 0.57 kb downstream of the stop 
codon (gSHMT). SCN resistance was restored in the complemented, 
transformed hairy roots (Fig. 3g; P< 0.0001), confirming that SHMT 
is the Rhg4 gene. 

To gain insight into the mechanistic basis of resistance, a structural 
model of SHMT was obtained by applying homology modelling. The 
predicted structure was then used to examine how the variant geno- 
types may be affecting the structural and functional properties of 
the enzyme. Forrest P130R and N358Y are co-localized with the 
tetrahydrofolate (THF)/5,10-methylene tetrahydrofolate (MTHF)/ 
5-formyltetrahydrofolate (FTHF) binding site and are in close proxi- 
mity to pyridoxal 5’-phosphate (PLP)-serine (PLS), PLP-glycine (PLG), 
and one of the two glycine binding sites. The position of the E61K 
mutation overlaps with the PLS and THF/MTHF/FTHF binding sites 
(Fig. 4a, b, Supplementary Data and Supplementary Video 1). The 
findings suggest that these residue changes may directly affect the 
reversible interconversion of serine and THF to glycine and MTHF. 
On the other hand, the M125] mutation in the Forrest F6756 TILLING 
mutant is found in an interior B-sheet (Fig. 4a, b), indicating that there 
may be a different mechanism altering the function of SHMT in 
this mutant, perhaps through the structural instability of the region 
affected by the TILLING mutation. 

We further tested the ability of the Forrest, Essex, F6266 and F6756 
Shmt proteins to complement an Escherichia coli glycine auxotroph’®. 
Essex Shmt fully restores growth, Forrest Shmt partially restores 
growth, and the F6266 and F6756 Shmt proteins are unable to restore 
growth of the mutant (Fig. 4c, d). Although there seems to be a dif- 
ference in the ability of the Forrest and Essex Shmt proteins to 
complement the E. coli glycine auxotroph, this finding establishes that 
both the Forrest and Essex alleles encode functional Shmt enzymes. 
Consistent with this observation, kinetic studies show different reaction 
kinetics of the Forrest and Essex Shmt proteins. Essex Shmt reaction 
velocity increases with increasing concentrations of THF, peaks, and 
then declines to a lower rate and remains steady (Fig. 4e). On the other 
hand, Forrest Shmt seems to follow typical Michaelis-Menten kinetics 
(Fig. 4fand Supplementary Fig. 4). The inhibition of Essex Shmt activity 
at high THF concentrations might have regulatory implications in a 
cellular environment’. The residue changes in Forrest Shmt seem to 
change this regulatory property of the enzyme. 

Shmt is a ubiquitous enzyme in nature with a key role in one-carbon 
folate metabolism that is conserved across kingdoms. Although the 
enzyme has multiple catalytic activities, one of its main roles is to 
catalyse the reversible conversion of serine and THF to glycine and 
MTHF to supply one-carbon units for de novo purine, thymidylate and 
methionine synthesis, underlying its importance in DNA synthesis 
and cellular methylation reactions. Consequently, in humans, muta- 
tions in SHMT and folate deficiency have been linked to a wide range 
of disease states'* 7’. 

Transcriptional and metabolic profiling studies of syncytia, the 
feeding cells formed by cyst nematodes in plant roots, support a high 
demand on folate one-carbon metabolism for their development and 
maintenance’; perturbations to which could severely compromise 
their activity. Syncytia induced in plants resistant to SCN degenerate 
by what has been described as a hypersensitive response, a form of 
localized programmed cell death (PCD) in plants to ward off invading 
pathogens. Molecular analysis has identified increased defence-related 
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Figure 2 | Haplotypes identified at SHMT in 28 soybean lines. The SHMT 
coding region for the 28 lines shown here was sequenced. Lines are classified 
resistant (R) to SCN if FI = 10% and susceptible (S) if FI > 10%. Polymorphic 
sites were positioned relative to the first nucleotide of the start codon in Forrest. 
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the SCN phenotype. 
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Figure 3 | Functional validation of SHMT by VIGS, RNAi and 
complementation. a, Nematode reproduction on SCN-resistant RIL ExF67 
inoculated with BPMV or BPMV-SHMT (P < 0.0001). b, qPCR of SHMT 
transcript levels in control and SHMT-silenced roots (mean and s.e. of five 
samples). c, Nematode reproduction on ExF67 hairy root lines transformed 
with a SHMT RNAi construct (P < 0.01). d-f, Forrest SHMT promoter-GUS 
analysis in ExF67 (d, e) and ExF63 (f) showing expression in syncytial feeding 
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cells at 3 days after inoculation with SCN. N, nematode; Syn, syncytium. Scale 
bars, 50 jim. g, SCN reproduction on ExF63 hairy root lines transformed with a 
5.1-kb Forrest SHMT gene fragment (P < 0.001). In graphs, diamonds 
represent the number of cysts on a single root system or hairy root line and the 
bars indicate the mean values. At least three independent experiments were 
performed showing similar results. Data from one experiment are presented. 
Letters indicate a significant difference at the indicated P value. 
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Figure 4 | Modelled structure and biochemical analysis of SHMT. 

a, Homology model of the Essex SHMT homodimer. Forrest polymorphisms, 
P130R and N358Y, are located on the dimer surface; the Forrest F6266 (E61K) 
mutation is buried in the dimerization interface; the Forrest F6756 (M1251) 
mutation is located in the core of each monomeric subunit. b, Split-open view 
of the SHMT dimer with the ligand binding sites located at the interface. 

c, d, Complementation of an E. coli glycine auxotroph (GS245(DE3)pLyss; 


gene expression associated with oxidative stress, apoptotic cell death, 
and the plant hypersensitive response in syncytia formed in resistant 
plants’>. 

Our findings are consistent with the dominant nature of Rhg4- 
mediated resistance and support a model wherein the Forrest Shmt 
has acquired a new or altered function. Computational analysis 
predicts that the polymorphisms in Forrest Shmt reside near ligand- 
binding sites, and our biochemical analyses suggest that these residue 
changes probably impair a key regulatory property of the enzyme. Any 
changes in Shmt function can have wide-ranging affects on one- 
carbon folate metabolism. Alterations to folate homeostasis leading 
to folate deficiency, which can induce apoptosis in mammalian cells*”, 
may trigger hypersensitive-response-like programmed cell death of 
the developing feeding cells and subsequent death of the nematode. 
Alternatively, the Forrest Shmt may lead to the production of a nema- 
ticidal compound or serve as the target of a nematode-secreted small 
molecule or effector protein to trigger a resistance signalling pathway. 
Although the nutritional requirements of plant-parasitic nematodes are 
not well defined, it is assumed that nematodes, like other animals, acquire 
folate from their diet. Thus, the nematode’s nutritional requirements 
may also be influencing folate metabolism in developing feeding cells. 


Tetrahydrofolate (mM) 


shmt—) by the Essex, Forrest, F6266 and F6756 Shmt proteins. Absorbance at 
600 nm (Agogo) of cultures was measured at different time points as shown, 
uninduced (c) or induced (d) with 0.25 mM IPTG. GS245(DE3)pLyssS carrying 
the pET28a vector was used as a control. e, f, Enzyme kinetics of Essex (e) and 
Forrest (f) Shmt proteins. The graphs were plotted as rate of product formation 
per min per mg of enzyme at varying concentrations of tetrahydrofolate and 
2 mM serine. Graphs shown are representative of three experiments. 


Perturbations to the plant’s folate pathway may lead to a nutritional 
deficiency that starves the nematode. Upon nematode death, the loss 
of stimuli required for maintenance of the feeding cells results in 
their degeneration. The work described here provides a foundation 
for future molecular and biochemical studies aimed at achieving a 
mechanistic understanding of how Shmt functions in resistance to 
SCN. 


METHODS SUMMARY 


Three F,.¢ recombinant inbred line (RIL) populations segregating for resistance to 
SCN Hg type 0 were developed from crosses of SCN-resistant cv. Forrest (F) with 
the SCN-susceptible soybean cultivars Williams 82 (W) or Essex (E) and used in 
mapping studies. Because Forrest resistance to SCN requires both rhg1 and Rhg4 
(ref. 5), genotyping was conducted using DNA markers flanking both loci to detect 
informative recombinants at the Rhg4 locus (Supplementary Table 1). The two F>.¢ 
RILs, ExF67 (rhg1prhg1pRhg4pRhg4,) and ExF63 (rhg1prhg1pRhg4,Rhg4p), used 
in functional analyses are resistant and susceptible, respectively, to SCN'®. An 
ethylmethane sulphonate (EMS)-mutagenized M2 population of Forrest contain- 
ing 1,920 M2 families was used to screen for mutations as previously described’. 
VIGS and hairy root assays were carried out as previously described'*"*. A 5,103- 
bp Forrest SHMT genomic DNA fragment (GenBank accession number 
JQ714083) was cloned and sequenced for complementation analysis. Homology 
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modelling was done using MODELLER-9”*. Escherichia coli complementation 
analysis'® and SHMT assays” were performed as described. 


Full Methods and any associated references are available in the online version of 
the paper. 
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Nematode and plant material. The SCN (Heterodera glycines Ichinohe) inbred 
population PA3 (Hg type 0) used in this study was mass-selected on soybean cv. 
Williams 82 according to standard procedures” at the University of Missouri. The 
soybean cultivar Forrest” is resistant to SCN PA3. The soybean cultivars Essex” 
and Williams 82 (ref. 31) are susceptible to SCN PA3. Forrest was used to develop 
an ethyl methane sulphonate (EMS)-mutagenized M2 population of 1,920 lines 
for TILLING”. The two F.¢ RILs, ExF67 (rhglrhg1-Rhg4;Rhg4;) and ExF63 
(rhglrhg1,Rhg4,Rhg4,), used in this study are resistant and susceptible, res- 
pectively, to PA3'°. These two RILs differ at the majority of markers assigned to 
the Rhg4 region and seem to be nearly opposite recombination events. The collec- 
tion of soybean lines used in this study was obtained from the USDA Soybean 
Germplasm Collection, University of Ilinois. 

Map-based positional cloning of the Rhg4 gene. Three genetic populations 
segregating for resistance to SCN PA3 (Hg type 0) were used for mapping. 
These included an F>.¢ RIL population from a cross between Forrest and Essex” 
(98 individuals) and two large F2. RIL populations generated from crosses 
between Forrest and either Essex (1,755 lines) or Williams 82 (2,060 lines), to 
enrich the chromosomal interval carrying the Rhg4 gene with recombinants. SCN 
phenotyping was conducted according to ref. 32. 

Because Forrest resistance to SCN requires both rhg1 and Rhg4 genes°, geno- 
typing was conducted using DNA markers flanking both loci to detect informative 
recombinants at the Rhg4 locus (Supplementary Table 2). The SSR markers 
Satt632, Sat_162 (http://soybase.org) and GMES6186 (ref. 33) were used to 
identify chromosomal breakpoints at the Rhg4 locus. PCR amplifications were 
performed using DNA from individuals from each of the three genetic popula- 
tions. Cycling parameters were as follows: 35 cycles of 94°C 30s, 50°C 30s and 
72°C 30s with 7 min of extension at 72 °C. The PCR products were separated on 
3-4% metaphor agarose gels. The identified recombinants were subject to a second 
screening by using the SSR markers Sat_210 and Satt309 (http://soybase.org) and 
SIUC-SAT143 to identify the rhg1 genotype of each recombinant. 

To enrich the chromosomal regions carrying the Rhg4 locus with DNA markers, 
the GenBank published sequences AX196297 and AX197417 spanning this region 
were used to design PCR primers every 5 to 10 kb of the 300 kb carrying the Rhg4 
locus (Supplementary Table 2). DNA from Forrest, Essex and Williams 82 were 
tested with each primer by using a modified EcoTILLING protocol to find and 
map polymorphic sequences at the Rhg4 locus’®’*. The identified SNP and indel 
DNA markers were integrated into the informative recombinants to identify 
chromosomal breakpoints and the interval that carried the Rhg4 gene. 

The closest DNA markers harbouring the Rhg4 locus were used to screen three 

Forrest BAC libraries**. The BAC clone 100B10 was identified, integrated with the 
developed genetic map, and partially sequenced*"”. 
Isolation of the SHMT genomic and cDNA sequences. A 5,103-bp Forrest 
SHMT genomic DNA fragment (GenBank accession number JQ714083) was 
cloned and sequenced. The fragment spans 2,339-bp of sequence 5’ of the ATG 
start site, 2,189 bp of sequence from start to stop including 3 exon and 2 introns, 
and 0.575 kb of sequence 3’ of the stop codon. Because the BAC clone 100B10 
contains only a partial SHMT gene sequence that includes the 2,339 bp of sequence 
5' of the ATG start site and 1,315 bp downstream of the ATG start site (Fig. 1a), we 
used an internal SacI site at position 108 from the ATG start for a PCR-based 
cloning approach of the full-length genomic sequence. First, a 2,447-bp fragment, 
including the 2,339 bp of sequence 5’ of the ATG start site and 108 bp of exon 1, 
was amplified by PCR using both a forward primer designed with an Ascl site and 
a reverse primer spanning the internal Sacl site. The fragment was digested and 
cloned into the CGT35S vector** by using Ascl and SacI. An Sbfl site was also 
introduced into the forward primer internal to AscI for subsequent subcloning for 
complementation analysis (Supplementary Table 2; see below). The remainder of 
the SHMT gDNA fragment, including the unique internal Sacl site, was amplified 
from Forrest genomic DNA by PCR with a forward primer spanning the internal 
Sacl site and a reverse primer designed with a KpnI site. The fragment was digested 
and cloned into the SacI and KpnI sites downstream of the 5’ fragment in the 
above CGT35S clone. The reverse primer was designed with an Sbfl site internal to 
KpnI for the purpose of subsequent subcloning for complementation analysis 
(Supplementary Table 2). The fragments were ligated together using the internal 
Sacl restriction site to generate the 5,103-bp SHMT genomic DNA fragment and 
were sequence-verified for accuracy. Primers designed to the Forrest genomic 
DNA sequence were used to clone the Essex SHMT genomic DNA sequence. 
PCR primers based on the Forrest and Essex genomic DNA sequences were used 
to amplify the corresponding cDNA sequences. Genomic DNA was isolated from 
young leaves using the DNeasy plant mini kit (Qiagen). Total RNA was isolated 
from roots using the RNeasy plant mini kit (Qiagen), and cDNA was synthesized 
using a CDNA synthesis kit (Invitrogen). 
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Mutation screening of SHMT. Primers specific for SHMT (Supplementary Table 
2) were used to screen a population of 1,920 EMS-mutagenized M2 lines from the 
SCN-resistant cv. Forrest'"'’. The gene was divided into three intervals (Fig. 1b), 
and TILLING was performed as previously described'*. The SHMT gene of each 
mutant was sequenced to characterize the identified allele and its subsequent 
amino acid changes within the predicted protein sequences. SIFT predictions were 
performed on identified mutations. SIFT predicts whether an amino acid substi- 
tution affects protein function based on sequence homology and the physical 
properties of amino acids**. SIFT predictions with median sequence conservation 
(MC) <3.25 are considered confident. Changes with a SIFT score <0.05 are 
predicted to be damaging to the protein. Both missense mutations identified 
had MC <3.25 (Fig. 1b), thus the SIFT predictions can be considered confident. 
Phenotype and zygosity analyses of SHMT TILLING mutants. Mutant seeds 
were planted and scored for their SCN female index as described**. DNA from 
each plant was subjected to TILLING analysis. Wild-type Forrest reference DNA 
was withheld from the reaction tube before mismatch analyses to detect the 
zygosity level of the identified mutant. 
Haplotyping of soybean lines. A total of 81 soybean lines (plant introductions, 
landraces, and elite cultivars) representing 90% of the genetic variability in soybean’? 
were scored for their SCN female index. Lines were classified resistant (R) to SCN if 
the FI was =10% and susceptible (S) if the FI was >10%. Soybean lines were 
genotyped at the Rhg4 locus by using the DNA markers Sat_162 and SUB1 and 
at the rhg1 locus by using the DNA markers 560, 570 and Satt309. The coding 
region of SHMT for 28 lines was sequenced (Fig. 2). Common SNPs and indels 
were identified and used to determine the different SHMT haplotypes. 
Virus-induced gene silencing. Bean pod mottle virus (BPMV, genus Comovirus) 
is an effective virus-induced gene silencing (VIGS) vector for soybean. BPMV hasa 
bipartite positive-strand RNA genome consisting of RNA-1 and RNA-2. We used 
a DNA-based system that places the cDNAs of BPMV genomic RNA-1 and RNA- 
2 under the control of the cauliflower mosaic virus (CaMV 35S) promoter. Two 
BPMV VIGS vectors, pBPMV IA-R1M and pBPMV-IA-D35, were used in this 
study'’. pBPMV-IA-D35 is a derivative of pPBPMV-IA-R2 containing BamHI and 
KpnI restriction sites between the cistrons encoding movement protein and the 
large coat protein subunit. Briefly, a 328-bp fragment (spanning base pairs 210- 
537) of the SHMT cDNA sequence (GenBank accession number JQ714080) was 
amplified from soybean (cv. Forrest) root cDNA by RT-PCR. PCR products were 
digested with BamHI and KpnI and ligated into pBPMV-IA-D35 digested with the 
same enzymes to generate pBPMV-IA-SHMT (BPMV-SHMT). Gold particles 
coated with pBPMV-IA-RIM and pBPMV-IA-SHMT were co-bombarded into 
soybean leaf tissue as described"*. At 3-4 weeks after inoculation, BPMV-infected 
leaves were collected, lyophilized, and stored at —20°C for future experiments. 
Infected soybean leaf tissues were ground in a mortar and pestle with 0.05 M 
potassium phosphate buffer (pH 7.0) and used as virus inoculum for VIGS assays. 
The SCN-resistant RIL ExF67 was inoculated with pBPMV-IA-SHMT. Control 
plants were infected with BPMV only. Each treatment consisted of at least 12 
plants. Unifoliate leaves of 9-day-old plants were rub-inoculated with virus using 
carborundum as described"*. Plants were grown in a growth chamber set to the 
following conditions: 20-21 °C, 16h light/8h dark, and 100 pmol m™*s_* light 
intensity. Twenty-one days after virus inoculation, plants were inoculated with 
1,500 SCN eggs and maintained at 20 °C for 35 days. Cysts were isolated from the 
root systems of individual plants by decanting and sieving and counted under a 
stereomicroscope. The results were plotted and analysed for statistical significance 
by an unpaired t-test using GraphPad PRISM software. To estimate SHMT gene 
silencing in roots, root tissues were collected at 21 days after virus inoculation (the 
time of nematode inoculation) from two representative plants inoculated with 
either pPBPMV-IA-SHMT or BPMV only and frozen at —80 °C for RNA isolation 
and qPCR analysis. 
Hairy root RNAi experiments. A 338-bp fragment (spanning base pairs 205-542) 
of the SHMT cDNA sequence was amplified from soybean (cv. Forrest) root cDNA 
by RT-PCR, cloned into the pDONR/Zeo gateway cloning vector (Invitrogen), 
and subsequently moved to a gateway RNAi binary vector under the control of the 
nematode-inducible Glyma15g04570.1 promoter’* (pZF-RNAi vector) to generate 
pZF-SHMTi. The pZF-RNAi vector was constructed by introducing gateway 
cloning sites flanking the FADR intron downstream of pZF promoter in the 
pAKK vector*, which has a GFP selectable marker in planta. Transgenic ExF67 
hairy roots transformed with pZF-SHMTi were produced from soybean cotyle- 
dons’*. ExF63 and ExF67 hairy roots transformed with pZF-GUSi (the pZF-RNAi 
vector containing a portion of the GUS gene) were used as susceptible and resistant 
controls, respectively. GFP-positive hairy roots were root-tip propagated three 
times on media containing antibiotic to clear Agrobacterium before nematode 
inoculation as described previously”’. Briefly, hairy roots (3-4 cm in length) were 
grown in square Petri plates and infected with approximately 400 sterile infective 
second-stage nematode juveniles (J2s) 1 cm above the root tip. The plates were 
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incubated in the dark at room temperature for 30 days. After 30 days, cysts were 
counted under a stereomicroscope. The experiment was conducted independently 
three times with a minimum of 12 independent hairy root lines per treatment. The 
results were plotted and analysed for statistical significance by an unpaired t-test 
using GraphPad PRISM software. 

Promoter-GUS analysis. A 2,339-bp fragment corresponding to sequence 5’ of 
the ATG start site of the Forrest SHMT gene (GenBank accession number 
JQ714083) and the same region from the Essex SHMT gene (GenBank accession 
number JQ714084) were amplified by PCR from the BAC clone 100B10 and Essex 
gDNA, respectively, and cloned into the pYXT1 vector’ to generate trans- 
criptional fusions with the B-glucuronidase (GUS) gene. Soybean hairy roots 
transformed with these constructs were generated and infected with SCN. At 2 
and 4 days after inoculation, root pieces excised from the infection zone were 
stained for GUS activity**. Multiple roots from at least five independent lines were 
stained for each construct. Root pieces were fixed with 4% v/v paraformaldehyde 
in phosphate-buffered saline overnight at room temperature, embedded in paraffin, 
and sectioned longitudinally to a thickness of 10 um. The sections were observed 
using differential interference contrast microscopy on a Vanox (Olympus) micro- 
scope and photographed with a CMOS colour digital camera. 

Genomic complementation experiments. The 5,103-bp Forrest SHMT genomic 
DNA fragment was subcloned into the Sbf1 restriction site of the pAKK binary 
vector, which has a GFP selection for transgenic events. Transgenic hairy roots 
were produced and infected with SCN as described for RNAi experiments. The 
SCN-susceptible RIL ExF63 was used for the complementation experiment. 
Control hairy roots were produced by transforming ExF63 and ExF67 hairy roots 
with the pAKK binary vector carrying only the SHMT promoter sequence. The 
experiment was conducted independently five times with a minimum of fifteen 
independent hairy root lines per treatment. The results were plotted and analysed 
for statistical significance by an unpaired t-test using GraphPad PRISM software. 
RNA isolation and qPCR analysis. Total RNA was isolated from root tissues 
using the RNeasy plant mini kit (Qiagen) according to the manufacturer’s instruc- 
tions. Real-time qRT-PCR was conducted as described’*. Samples were normalized 
relative to soybean ubiquitin and calibrated to the expression in the BPMV control 
sample. 

Computational methods. A two-pronged computational approach was per- 
formed to annotate structurally and functionally the identified Shmt protein 
and to estimate the effects of the mutations on Shmt function. First, a homology 
model of Shmt was obtained using Essex sequence as a target. Second, the func- 
tional sites were mapped onto the surface of Shmt using the structural information 
of ligand binding by Shmt homologues. The homology analysis of Shmt deter- 
mined 43 structurally resolved Shmt homologues from a diverse set of bacterial 
and mammalian species; no structurally resolved plant Shmt proteins were found. 
Among the group of four homologues with the highest sequence similarities, the 
mouse SHMT (Protein Data Bank accession 1EJI) with the largest coverage of the 
Shmt sequence (sequence identity 57%, template coverage 100%) was selected as a 
template for homology modelling of Shmt. Homology modelling was done using 
MODELLER-9” and the top-ranked model was selected from the set of candidate 
models using the MODELLER scoring function. To determine the ligand binding 
sites for pyridoxal 5’-phosphate (PLP)-serine (PLS), PLP-glycine (PLG), and THF/ 
MTHE/FTHE, the obtained model of Shmt was structurally aligned with each of the 
orthologous Shmt proteins known to interact with the small ligands, and the ligand 
binding site from each homologue was mapped on to the surface of the Shmt 
model through the structural alignment (note that the contributing parts of the 
ligand binding sites are not identical between the two monomers). The residues 
constituting the glycine binding sites, GBS1 and GBS2, in the Shmt homologues 
were identified in the literature and then mapped onto the structure of Shmt in a 
similar way by using the structural alignment of SHMT with its homologue. 

E. coli complementation experiment. An E. coli GS245 glyA strain’? was 
obtained from the E. coli Genetic Stock Center. The GS245 strain was lysogenized 
with ADE3 phage (Novagen) according to manufacturer’s protocols and trans- 
formed with pLysS plasmid (Novagen) to generate GS245(DE3)pLysS for com- 
plementation analysis'®. The cDNA sequences corresponding to Forrest, Essex, 
F6266 and F6756 Shmt proteins were subcloned at NdeI and HindIII sites of 
pET28a vector (Novagen) for expression of the recombinant proteins with an 
N-terminal 6X-histidine tag. GS245(DE3)pLysS strains carrying the plasmids 
were grown in LB medium overnight. The A¢oo value of each culture was measured 
and an equal absorbance of cells was collected and washed twice in M9 minimal 


medium (1X M9 salts (Sigma), 10% w/v glucose, 50 mg ml’ phenylalanine, and 
10mgml * of thiamine) and finally re-suspended in two flasks of M9 minimal 
medium with required antibiotics for selection. One of the flasks was supplemented 
with IPTG at 0.25 mM final concentration. The cultures were grown in a shaker 
incubator (New Brunswick Scientific) at 37 °C and 180 r-p.m. setting. Agog values of 
cultures were measured at 0 h and 6h and thereafter every 3h up to 24h. The Agoo 
measurements were plotted versus time of incubation to obtain growth curves. 
Shmt purification and kinetic analysis. Shmt proteins were purified from GS245 
cultures grown at 30°C for 4h after induction with 0.5 mM IPTG at Agoo of 0.5. 
His-tagged proteins were purified using His Pur Cobalt Resin (Thermo Scientific) 
according to manufacturer’s instructions. Purified proteins were dialysed against 
three changes of 50 mM potassium phosphate buffer, pH 7.4, containing 10% v/v 
glycerol, 10 * M tris(hydroxypropyl)phosphine, and 10° M pyridoxal L-phosphate. 
Protein concentration was estimated by the Bradford method. Shmt assays were 
performed as described previously”. Briefly, 50 ul reactions comprising 2mM 
serine, 0.2-4mM _ (6R,S)-H4PteGlu (Tetrahydrofolate, THF) (Schircks Lab), 
4mM tris(hydroxypropyl)phosphine, 0.25mM PLP, and 6,1g of enzyme in 
50 mM potassium phosphate buffer (pH 7.4) were incubated at 30 °C for 20 min. 
The reaction was stopped by adding 25 1] of 0.1 M dithiothreitol (DTT) and 50 pl 
of 0.1 M NaBHg. Each reaction was further incubated for 15 min at 37 °C to drive 
the reduction of 5,10-methylene tetrahydrofolate (MTHF) to 5-methyltetrahydro- 
folate to completion. The samples were then boiled for 3 min and centrifuged at 
20,000 g to remove precipitated protein. The supernatant from the centrifugation 
step was supplemented with 25 11 0.6 M DTT to prevent product decomposition. 
A no serine control reaction was performed simultaneously. A 1:10 dilution of 
this sample was used for HPLC separation and quantification. HPLC injection 
volume was typically 5 pl. 5-methyltetrahydrofolate in the final reaction mixture 
was detected and quantified by HPLC coupled with a fluorescent detector set at 
289 nm excitation and 359 nm emission wavelengths (Shimadzu Corp). A Restek 
Pinnacle II C18 column (5 tm, 150 X 4.6mm, Restek US) was used for HPLC 
separation. The product separation was done isocratically with a mobile phase 
consisting of 6% v/v acetonitrile in 30 mM phosphate buffer pH 3.0 (adjusted with 
phosphoric acid) at 0.7 ml min '. The level of 5-methyltetrahydrofolate detected 
was quantified by comparison with standards. For this, a standard calibration 
curve was created with known amounts of 5,10-methylene tetrahydrofolate 
reduced to 5-methyltetrahydrofolate using 0.1 M NaBH, and 0.1 M DTT and from 
peak areas of 5-methyltetrahydrofolate eluted. The specific activity of the enzyme 
at varying concentrations of THF was plotted against THF concentrations to 
obtain the kinetic curve. Kinetic analysis was repeated three times for each of 
the enzymes using independent enzyme preparations. 
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An early age increase in vacuolar pH limits 
mitochondrial function and lifespan in yeast 


Adam L. Hughes! & Daniel E. Gottschling' 


Mitochondria have a central role in ageing. They are considered to 
be both a target of the ageing process and a contributor to it’. Altera- 
tions in mitochondrial structure and function are evident during 
ageing in most eukaryotes’, but how this occurs is poorly under- 
stood. Here we identify a functional link between the lysosome-like 
vacuole and mitochondria in Saccharomyces cerevisiae, and show 
that mitochondrial dysfunction in replicatively aged yeast arises 
from altered vacuolar pH. We found that vacuolar acidity declines 
during the early asymmetric divisions of a mother cell, and that 
preventing this decline suppresses mitochondrial dysfunction and 
extends lifespan. Surprisingly, changes in vacuolar pH do not limit 
mitochondrial function by disrupting vacuolar protein degrada- 
tion, but rather by reducing pH-dependent amino acid storage in 
the vacuolar lumen. We also found that calorie restriction pro- 
motes lifespan extension at least in part by increasing vacuolar 
acidity via conserved nutrient-sensing pathways’. Interestingly, 
although vacuolar acidity is reduced in aged mother cells, acidic 
vacuoles are regenerated in newborn daughters, coinciding with 
daughter cells having a renewed lifespan potential*. Overall, our 
results identify vacuolar pH as a critical regulator of ageing and 
mitochondrial function, and outline a potentially conserved mecha- 
nism by which calorie restriction delays the ageing process. Because 
the functions of the vacuole are highly conserved throughout evolu- 
tion®, we propose that lysosomal pH may modulate mitochondrial 
function and lifespan in other eukaryotic cells. 

We sought to understand the basis of age-induced mitochondrial 
dysfunction, using the budding yeast S. cerevisiae. Ageing in yeast 
(replicative ageing) is defined as the number of times an individual 
cell divides’, and aged yeast cells exhibit many of the characteristics of 
age-induced mitochondrial dysfunction observed in metazoa’. These 
include mitochondrial fragmentation, increased levels of mitochondrial 
reactive oxygen species, and increased loss of mitochondrial DNA in 
their progeny*°. We began our inquiry by further characterizing age- 
associated mitochondrial phenotypes using a genetic system in yeast 
that enriches for replicatively aged mother cells, the mother enrichment 
program (MEP)"°. By monitoring an outer mitochondrial membrane 
protein, Tom70, tagged with green fluorescent protein (Tom70-GFP), 
we found that mitochondrial dysfunction was prevalent in aged yeast, 
consistent with results from earlier studies®. Mitochondria—which are 
normally tubular—fragmented and ultimately aggregated in aged cells 
(Fig. 1a). Mitochondrial fragmentation was present early in the ageing 
process (86% of cells at 8 divisions), and progressed to large aggre- 
gates and small fragments (93% of cells at 17 divisions) that persisted 
throughout ageing (median lifespan of 28 divisions). We observed 
similar changes using an inner mitochondrial membrane protein, 
Tim50-GFP (Supplementary Fig. 2a). Mitochondrial function also 
declined in aged cells. We measured the mitochondrial membrane 
potential (A) of aged cells using 3,3’ -dihexyloxacarbocyanine iodide 
(DiOCg,), a A Y-dependent mitochondrial fluorescent dye"’, and found 
that 80% of cells had reduced mitochondrial DiOC, staining by 7 
divisions (Supplementary Fig. 2b). At 18 divisions, AY” was even lower 
and remained low thereafter (100% of cells by 28 divisions). Consistent 


with this, A ¥-dependent import of the mitochondrial matrix chimaeric 
protein preCox4—mCherry’ was also reduced in aged cells (Supplemen- 
tary Fig. 2c). Collectively, these results indicate that mitochondrial struc- 
ture and function are progressively altered during the ageing process. 
Mitochondrial structure is regulated by 250 identified genes in 
young cells'*!°. We considered whether age-induced mitochondrial 
dysfunction might be the result of the altered function of processes 
associated with one or more of these gene products. To explore this 
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Figure 1 | Age-induced mitochondrial dysfunction is suppressed by VMA1 
overexpression. a, b, Age (divisions) is shown at the bottom of all panels, and 
represents the exact age for young cells (0-5) and the median age for older cells, 
as determined by calcofluor staining of bud scars. n = 30 for all time points. A 
representative image is shown for each time point. M, mother cell. DIC, 
differential interference contrast. a, Mitochondrial morphology in aged cells 
expressing the mitochondrial outer membrane protein Tom70-GFP. 

b, Mitochondrial membrane potential as indicated by DiOCg staining of aged 
Tom70-mCherry wild type (WT) and VMA1-overexpressing (OE) cells 
incubated with or without 500 nM concA for 2h. Note that DiOC, stains the 
plasma membrane of aged cells. 
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idea, we overexpressed each of the 250 genes in a MEP strain expres- 
sing Tom70-GFP, and screened for genes whose overexpression 
delayed the onset of altered mitochondrial morphology in aged cells. 
Two suppressors identified in our screen, VMAI and VPH2, are 
required for activity of the vacuolar H*-ATPase (V-ATPase). The 
V-ATPase is an evolutionarily conserved protein complex that acidifies 
the vacuole (the lysosome in metazoa)°. Vmal is the catalytic subunit of 
the peripheral-membrane-associated V, sector of the V-ATPase™. 
Vph2 is an endoplasmic-reticulum-localized integral membrane 
protein required for V-ATPase assembly’. Overexpression of VMA1 
or VPH2 from a high-copy promoter suppressed age-induced mito- 
chondrial dysfunction in 93% and 77% of aged cells, respectively (17 or 
16 divisions) (Fig. 1b and Supplementary Fig. 3a). Importantly, sup- 
pression by VMAI or VPH2 overexpression required V-ATPase 
activity, as treatment with the specific V-ATPase inhibitor concana- 
mycin A (concA)'® blocked their ability to produce normal, tubular 
mitochondria (Fig. 1b and Supplementary Fig. 3a). 

Because overexpressing proteins required for vacuolar acidity pre- 
vented age-induced mitochondrial dysfunction, we tested whether 
vacuolar pH itself was changed in aged cells. Measurement of vacuolar 
acidity with four different pH-sensitive reporters revealed that the 
acidity of the vacuole decreased as newborn cells became aged mother 
cells (Fig. 2a, b and Supplementary Fig. 4a—c). Vacuoles in over 90% of 
newborn cells were acidic and were stained efficiently with the fluor- 
escent dye quinacrine, which accumulates in acidic environments’” 
(Fig. 2a and Supplementary Fig. 4a). However, 87% of cells that had 
produced daughters had decreased vacuolar acidity, as indicated by 
little or no quinacrine staining (Fig. 2a and Supplementary Fig. 4a). 
Kinetic analysis of vacuolar acidity decline using Pho8-SEP/mCherry 
(a dual fluorescent protein reporter system in which one copy of 
vacuolar Pho8 is fused to pH-sensitive super ecliptic pHluorin (SEP, 
which increases in fluorescence with increasing pH)'* and the other 
to pH-insensitive mCherry) revealed that vacuolar acidity declined 
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progressively during approximately the first four mother-cell divisions 
and then remained low for at least 18 divisions (Fig. 2b). However, 
acidity did not decline to the level observed in cells lacking V-ATPase 
function (Fig. 2b), suggesting that the aged cells had not completely 
lost vacuolar acidity. 

Because vacuolar acidity decreases before mitochondrial dysfunc- 
tion in ageing cells, we wondered whether overexpressing VMAI or 
VPH2 suppressed mitochondrial dysfunction by preventing vacuolar 
acidity decline. Indeed, overexpressing VMA1 or VPH2 increased vacu- 
olar acidity in =70% of aged mother cells (~18 divisions) (Fig. 2a and 
Supplementary Fig. 3b). Although it is unclear how VMAI1 over- 
expression increased vacuolar acidity, overexpressing VPH2 increased 
the levels of assembled V-ATPase at the vacuolar membrane (Sup- 
plementary Fig. 5). 

These results suggest that reduced vacuolar acidity in mother cells 
leads to the subsequent development of mitochondrial dysfunction. 
Consistent with this idea, young cells lacking the V-ATPase subunit 
VMA2 had reduced AY and mitochondrial morphology similar to 
aged cells (Supplementary Fig. 6a). Additionally, treatment of young 
cells with the V-ATPase inhibitor concA caused mitochondrial depo- 
larization just 30min after loss of vacuolar acidity (Fig. 2c and 
Supplementary Fig. 6b). This loss of AY was followed by mitochon- 
drial fragmentation and aggregation that resembled mitochondrial 
phenotypes present in aged cells. Collectively, these results indicate 
that vacuolar and mitochondrial function are intimately linked, and 
suggest that low levels of vacuolar acidity in mother cells cause mito- 
chondrial dysfunction by inducing mitochondrial depolarization that 
is followed by fragmentation and aggregation. 

Because mitochondrial function is a critical regulator of lifespan’”, 
we suspected that vacuolar pH would be as well. Consistent with this 
idea, yeast cells lacking V-ATPase activity have drastically shortened 
lifespans*®. We measured the replicative lifespan of wild-type, VMA1- 
and VPH2-overexpressing strains and found that overexpressing 
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Figure 2 | Vacuolar acidity is reduced in ageing cells and regulates 
mitochondrial function and lifespan. a—c, Notation is as for Fig. 1a, b. 

a, b, Vacuolar acidity as indicated by quinacrine (quin; a) staining of aged 
Vphi-mCherry wild-type (WT) and VMA1-overexpressing (OE) cells, and the 
vacuolar pH reporter Pho8-SEP in cells expressing Pho8-mCherry incubated 


262 | NATURE | VOL 492 | 13 DECEMBER 2012 


with or without 500 nM concA for 15 min (b). c, DiOC, staining of young cells 
expressing Tom70-mCherry treated with 500 nM concA for the indicated time. 
d, Replicative lifespan of wild-type and VMA1-overexpressing cells by 
micromanipulation. Median lifespan is indicated. P = 0.0002, Wilcoxon rank- 
sum test. n = 50 for wild-type and 56 for VMA1-overexpressing cells. 
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either gene significantly increased both median and maximal replica- 
tive lifespan (Fig. 2d and Supplementary Fig. 3c). These results suggest 
that vacuolar pH is indeed an important determinant of yeast lifespan. 

We then asked how vacuolar pH affects mitochondrial function. 
The vacuolar proton gradient generated by the V-ATPase is required 
for efficient protein degradation, and for transport of ions and basic 
and neutral amino acids into the vacuole for storage’. Blocking vacuo- 
lar protein degradation in a pH-independent manner by deleting the 
vacuolar protease PEP4 (ref. 21) did not alter mitochondrial struc- 
ture or AY (Supplementary Fig. 7). Therefore, we examined whether 
reduced import of vacuolar metabolites was responsible for mitochon- 
drial dysfunction under conditions of decreased vacuolar acidity. If 
reduced import of a particular amino acid or ion into the vacuole led 
to mitochondrial dysfunction, then overexpressing the transporter 
required for import of that metabolite into the vacuole might suppress 
mitochondrial dysfunction in cells with a low vacuolar proton gra- 
dient. To test this, we reduced the vacuolar proton gradient by treating 
cells with low concentrations of concA, and measured mitochondrial 
AY’ with DiOC, in cells overexpressing individual vacuolar amino acid 
or ion transporters. Overexpressing most characterized V-ATPase- 
dependent vacuolar importers*”** had no (NHX1, VBA1, VBA3) or 
little (VCX1, VBA2) effect on mitochondrial function. However, over- 
expressing AVT1, a neutral amino acid transporter’’, prevented mito- 
chondrial dysfunction in 60% of cells (Supplementary Fig. 8a, b). This 
suggests that disrupting proton-dependent neutral amino acid storage 
in the vacuole causes mitochondrial dysfunction. 

Wealso found that AVT1 was critical in connecting the vacuole and 
mitochondria during ageing. Overexpressing AVTI prevented altera- 
tions in mitochondrial structure and AY in 63% of aged cells (19 
divisions; Fig. 3a), even though vacuolar acidity was reduced in these 
cells (17 divisions; Supplementary Fig. 9a). Conversely, deleting AVT1 
accelerated the development of age-induced mitochondrial dysfunc- 
tion (Fig. 3b) without affecting the kinetics of vacuolar acidity decline 
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(Supplementary Fig. 9b). Deleting AVT1 also prevented the suppres- 
sion of mitochondrial dysfunction by VMA1 and VPH2 overexpres- 
sion without affecting vacuolar acidity (Fig. 3c and Supplementary 
Fig. 3d). Collectively, these results suggest that reduced vacuolar acidity 
in aged cells might cause mitochondrial dysfunction by preventing 
adequate import of neutral amino acids through the vacuolar trans- 
porter Avtl. Importantly, overexpressing or deleting AVT1 was suffi- 
cient to extend or shorten replicative lifespan, respectively (Fig. 3d). 
This suggests that changes in vacuolar pH modulate lifespan at least in 
part through regulation of mitochondrial function. The differences in 
mitochondrial function and lifespan by AVT1 overexpression or dele- 
tion were not as marked as when vacuolar pH was increased or decreased 
(compare Fig. 3 to Figs 1 and 2), suggesting that substrates of additional 
V-ATPase-dependent transporters may contribute to age-induced 
mitochondrial dysfunction caused by reduced vacuolar acidity. 

Calorie restriction extends lifespan in most eukaryotes through 
conserved nutrient-sensing pathways’. In yeast, calorie restriction is 
thought to function through at least three nutrient-sensing pathways/ 
kinases: the cAMP-dependent protein kinase A (PKA) pathway, the 
target of rapamycin (TOR) pathway, and the putative S6 kinase/AKT 
homologue Sch9 (refs 20, 26). Reduced activity of any of these kinases 
mimics calorie restriction and extends lifespan. We found that growth 
in calorie-restricted carbon sources (0.5% glucose, 0.05% glucose and 
3% glycerol)”*”’ increased vacuolar acidity in young cells (Supplemen- 
tary Fig. 10), and prevented the decline of vacuolar acidity in ageing 
cells (Fig. 4a). Calorie restriction also suppressed mitochondrial dys- 
function in 90% of aged cells (21 divisions) in a V-ATPase-dependent 
manner (Fig. 4b). 

Calorie restriction seems to regulate vacuolar acidity through con- 
served nutrient-sensing pathways because inhibiting PKA, Sch9 or 
TOR by the lifespan-extending mutations gpa2A, sch9A or tor1A, also 
prevented the decline of vacuolar acidity and development of mito- 
chondrial dysfunction in ageing cells (Supplementary Fig. 11a, b)*°”*. 
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Figure 3 | Reduced vacuolar acidity causes mitochondrial dysfunction by 
disrupting amino acid homeostasis. a—c, Notation is as for Fig. 1a, b. 

a, b, DiOC; staining of aged Tom70-mCherry wild-type (WT), AVTI- 
overexpressing (OE) (a) and avt1A cells (b). c, Quinacrine (quin) and DiOC, 
staining of aged wild-type and avt14J cells overexpressing VMA1. d, Replicative 


No. of divisions 


lifespan of wild-type, avt14 and AVT1-overexpressing cells by 
micromanipulation. Median lifespan is indicated. P = 0.0376 (wild-type versus 
AVT1-overexpressing cells), P = 0.0045 (wild type versus avt14), Wilcoxon 
rank-sum test. n = 32 for wild type, 32 for AVT1-overexpressing, and 31 for 
avtlA. 
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Figure 4 | Calorie restriction extends lifespan by regulating vacuolar 
acidity. a, b, Notation is as for Fig. 1a, b. Quinacrine (a) or DiOC, (b) staining 
of cells expressing Vphl-mCherry (a) or Tom70-mCherry (b) aged in 2% or 
0.5% glucose with or without 500 nM concA for the final 2 h of ageing. 

c, d, Replicative lifespan of wild-type (WT), vma24 (c) and VMAI- 
overexpressing (OE) cells (d) grown with or without calorie restriction (CR; 
0.5% glucose). Median lifespan is indicated. c, P< 0.0001 (wild type versus 
wild-type calorie restricted), P = 0.0398 (vma2A versus vma2A calorie 
restricted), P< 0.0001 (wild type versus vma2A), Wilcoxon rank-sum test. 


Consistent with this, constitutively activating the PKA pathway by 
deleting the Ras GTPase-activating protein IRA2 (ref. 28), reduced 
vacuolar acidity and accelerated the development of mitochondrial 
dysfunction in ageing cells (Supplementary Fig. 11lc, d). Deleting 
IRA2 also prevented calorie-restriction-mediated enhancement of 
vacuolar acidity and suppression of mitochondrial dysfunction (Sup- 
plementary Fig. 11c, d). Furthermore, we found that lifespan extension 
by calorie restriction was prevented in a strain lacking V-ATPase 
activity (Fig. 4c) and, conversely, calorie restriction did not further 
extend the lifespan of a strain overexpressing VMAI1 (Fig. 4d). 
Collectively, these results indicate that vacuolar acidity is a target of 
calorie-restriction-regulated nutrient sensing pathways, and suggest 
that calorie restriction suppresses age-induced mitochondrial dysfunc- 
tion and extends lifespan at least in part by preventing the decline in 
vacuolar acidity in ageing cells. 

In budding yeast, cytokinesis is asymmetric with respect to ageing. 
Although mother cells age, they produce daughter cells with full lifespan 
potential, suggesting that the cellular events that cause ageing are reset 
in daughter cells*. During our analysis, we discovered a marked distinc- 
tion in vacuolar pH between mother and daughter cells. Although 
vacuolar acidity was reduced in mother cells, high acidity was restored 
in newborn cells, even in daughters of old mothers (Fig. 2a, b and 
Supplementary Fig. 4a—c). This asymmetry required V-ATPase activity, 
as it was eliminated in vma2A cells, which lack V-ATPase activity and 
have constitutively reduced vacuolar acidity (Supplementary Fig. 4d). 
Taking all our results together, the acidity of the vacuole seems to be 
critical for resetting lifespan in new yeast cells, and represents the 
earliest ‘rejuvenation’ event observed in this organism. 

Overall, our results support a model (Supplementary Fig. 1) in 
which a progressive decline in vacuolar acidity during early mother- 
cell divisions limits mitochondrial function and lifespan, and provide a 
potential mechanism by which amino acid and glucose limitation may 
converge to regulate lifespan. Somewhat surprisingly, our data indicate 
that at least in yeast, age-induced mitochondrial dysfunction is not the 
result of a block in vacuolar protein degradation, but is rather due to 
decreased uptake of neutral amino acids into the vacuolar lumen. 
How an apparent reduction in neutral amino acid storage leads to a 
decline in mitochondrial membrane potential remains to be deter- 
mined, but we speculate that mitochondria-dependent catabolism of 
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for vma2A calorie restricted. d, P< 0.0001 (wild type versus wild-type calorie 
restricted), P< 0.0001 (wild type versus VMA1-overexpressing), P = 0.0023 
(wild type versus VMA1-overexpressing calorie restricted), P = 0.52 (wild-type 
calorie restricted versus VMA1-overexpressing), P = 0.076 (VMAI- 
overexpressing versus VMA 1-overexpressing calorie restricted), Wilcoxon 
rank-sum test. n = 38 for wild type, 40 for wild-type calorie restricted, 36 for 
VMAI-overexpressing, and 38 for VMA1-overexpressing calorie restricted. 


excess cytoplasmic amino acids (or their derivatives) places a large 
burden on proton-dependent mitochondrial carrier proteins” and, 
in the process, overwhelms mitochondrial AY. Other metabolites 
may also contribute to this vacuolar-mitochondrial cross-talk, and 
the nature of relevant metabolite(s) will be influenced by the cellular 
growth environment. Given that V-ATPase-coupled amino acid/ion 
import’ and lifespan extension by reduction of amino acids in food*® 
are conserved among many organisms, we postulate that our findings 
in yeast might be applicable to other eukaryotes. 


METHODS SUMMARY 


A detailed description of all experimental methods including strains, plasmids, cell 
culture, MEP purification, fluorescent staining, microscopy, lifespan measurement, 
protein preparation, western blotting and the mitochondrial screen is provided 
in Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Strains. Yeast strains are listed in Supplementary Table 1. All strains are deriva- 
tives of S. cerevisiae S288c (BY) unless otherwise indicated. One-step PCR- 
mediated gene replacement and epitope tagging were carried out using standard 
techniques, the previously described template plasmids pRS306, pRS400 KanMX, 
pKT127 and pBS3é4 (refs 31, 32), and the newly created template plasmid pKT127- 
SEP. pBS34 was obtained from the Yeast Resource Center at the University of 
Washington with permission from R. Tsien’. Oligonucleotides for gene replace- 
ment and tagging are listed in Supplementary Table 2. Strains expressing empty 
vector, VMA1, VPH2 or AVT1 from a GPD promoter integrated into an empty 
region of chromosome 1 (199456-199457) were constructed by transformation of 
parental yeast strains with NotI-digested pAG306-GPD-empty chr 1, pAG306- 
GPD-VMAI chr 1, pAG306-GPD-VPH2 chr 1 or pAG306-GPD-AVT1 chr 1, 
respectively. Strains expressing Vma2-GFP were derived from the yeast GFP 
collection**. 

Plasmids. pAG306-GPD-empty chr 1, pAG306-GPD-VMAI chr 1, pAG306- 
GPD-VPH2 chr 1 and pAG306-GPD-AVT1 chr 1 were generated in two steps. 
First, we created pAG306-GPD-ccdB chr 1, a plasmid for high expression of genes 
from the GPD promoter that can be integrated into chromosome 1 (199456- 
199457) after NotI digestion. We generated pAG306-GPD-ccdB chr 1 by ligation 
of a SmalI-digested fusion PCR product that contained two ~500-base-pair regions 
of chromosome 1 flanking a Not! site into AatII-digested pAG306-GPD-ccdB 
(Addgene plasmid 14140)”. We generated the fusion PCR product using oligo- 
nucleotides Chr PartB Smal F and ChrlI PartA Smal R to amplify two templates 
generated by PCR of yeast genomic DNA using oligonucleotide pairs Chrl PartA 
NotI F and ChrI PartA Smal R, and ChrI PartB Smal F and ChrI PartB Not! R, 
respectively. Second, we inserted VMA1, VPH2 and AVT1 into pAG306-GPD- 
ccdB chr 1 from donor Gateway plasmids pDONR221-VMA1 (Harvard Institute 
of Proteomics (HIP) accession ScCD00023712), pDONR221-VPH2 (HIP acces- 
sion ScCD00012560) and pDONR221-AVT1 (HIP accession ScCD00012825) 
using LR clonase according to the manufacturer’s instructions (Invitrogen)”*. 
We created pAG306-GPD-empty chr 1 by digestion of pAG306-GPD-ccdB chr 
1 with HindIII and Spel to remove the ccdB gene, followed by blunt-ended religa- 
tion of the plasmid after filling in overhangs with T4 DNA polymerase (New 
England Biolabs). 

pKT127-SEP, used for creating in-frame C-terminal chromosomal fusions of a 
gene of interest with SEP, was created by removing GFP from pKT127 by restric- 
tion digestion with Pacl/Ascl and replacing it with Pacl/Ascl-digested SEP gen- 
erated by PCR of template plasmid pGM87 (ref. 18) using oligonucleotides SEP 
Pacl F and SEP AsclI R. 

Media and cell culture. Cells were grown exponentially for 15h to a maximum 
density of 5 X 10°cellsml~' before the initiation of all experiments. Because 
nutrients and growth phase affected vacuolar and mitochondrial function, 
extended log-phase growth was necessary to ensure vacuolar and mitochondrial 
uniformity across the cell population before the initiation of all experiments. Cells 
were cultured in YEPD (1% yeast extract, 2% peptone, 2% glucose) unless other- 
wise indicated. For calorie restriction experiments, cells were cultured in YEP plus 
the indicated carbon source. Yeast Complete (YC) medium used in the high-copy 
suppressor screen was previously described*’. ConcA (Sigma) was added to cul- 
tures at a final concentration of 250 or 500 nM as indicated in figure legends. 
Culturing and purification of aged MEP cells. MEP cells were aged and purified 
for downstream analysis as previously described with slight modifications'®”*. 
Briefly, 2.5 X 10’ cells from a 15h YEPD log-phase culture were washed twice 
in PBS, pH 7.4, and resuspended in PBS containing 3 mg ml”! Sulfa-NHS-LC- 
Biotin (Pierce) at a final concentration of 2.5 X 10’ cells ml” !. Cells were incu- 
bated for 30 min at room temperature (23 °C) and then washed twice in PBS and 
once in YEPD. Biotinylated cells were resuspended in 10 ml YEPD at 2.5 x 10° 
cells ml ' and recovered with shaking for 2h at 30°C. These cells were used to 
seed cultures at a density of 2X 10* biotinylated cells per ml in YEPD. 
B-Oestradiol (Sigma) was added to cultures at a final concentration of 1 [1M to 
initiate the MEP and cells were cultured at 30 °C for an appropriate time to obtain 
cells of a desired age (~1h for median age 2, 12 h for age 8, 24h for age 16, 48 h for 
age 28). Cell densities never exceeded 4 X 10° cells ml” ' and cells were washed and 
resuspended in fresh media every 24h (12 h for calorie restriction experiments) to 
prevent nutrient limitation. At each ageing time point, 1 X 10° total cells were 
harvested for purification, staining, and microscopy analysis. 

For live-cell purification, cells were washed twice in PBS, resuspended at a 
density of 2 x 10° cells ml! in 500 ul of PBS, and incubated for 30 min at room 
temperature with 25,1 streptavidin-coated magnetic beads (MicroMACS, 
Miltenyi Biotec). After incubation, cells were washed twice in PBS, resuspended 
in 8 ml of PBS, and loaded onto an LS MACS column (Miltenyi Biotec) equili- 
brated with 5 ml of PBS. After gravity flow-through of unlabelled cells and debris, 
columns were washed twice with 8 ml of PBS. Columns were then removed from 


the magnetic field and aged cells were eluted by gravity flow with 8 ml of PBS. For 
cell-staining experiments, purified cells were pelleted, resuspended in YEP con- 
taining 2% or 0.5% glucose at 2 X 10°cells ml‘, and recovered for 1h at 30°C 
before staining. 

For purification of cells after fixation, cells were washed once in PBS and resus- 
pended at 2 X 10® cells ml’ and fixed in 500 ull of 4% paraformaldehyde in H,O 
for 15 min. Cells were then washed twice in PBS before incubation with strepta- 
vidin beads at 4 °C instead of at room temperature. Additionally, 2 mM EDTA was 
added to PBS in all fixed-cell purification steps to prevent cell clumping. 

Labelling and purification of aged non-MEP cells (Supplementary Fig. 2c) was 

conducted exactly as described for MEP cells except that labelled mother cells were 
purified after 12 h of log-phase growth, reseeded, and purified a second time after 
another 12h of log-phase growth to obtain 18-division-old mother cells. Cells 
never exceeded a density of 7 X 10° cells ml. 
Fluorescent staining and microscopy. Quinacrine (Sigma) staining was per- 
formed as previously described*’. Briefly, 2 < 10° log-phase or purified aged cells 
were washed once in YEPD plus 100 mM HEPES, pH 7.6, and resuspended in 
100 ll of the same buffered media containing 200 [1M quinacrine. Cells were incu- 
bated for 10 min at 30°C and then 5 min on ice. Cells were pelleted and washed 
twice with ice-cold 100mM HEPES, pH 7.6, plus 2% glucose. Cells were resus- 
pended in 100 mM HEPES, pH 7.6, plus 2% glucose for imaging. Before imaging, 
cells were kept on ice and all images were obtained within 30 min of staining. 

For staining with 5-(and-6)-carboxy-2',7'-dichlorofluorescein diacetate’’ 
(CDCFDA) (Invitrogen) or 2',7'-bis (carboxyethyl)-5(6)-carboxyfluorescein” 
(BCECF-AM) (Invitrogen), 2 x 10° log-phase or purified aged cells were washed 
once in YEPD plus 100 mM HEPES, pH 7.6, and resuspended in 100 il of the same 
buffered media containing 10 4M CDCFDA or 50 uM BCECE. Cells were incu- 
bated for 30min at 30°C and then washed twice at room temperature with 
100mM HEPES, pH 7.6, plus 2% glucose. Cells were resuspended in 100 mM 
HEPES, pH 7.6, plus 2% glucose for imaging. 

DiOC, (Invitrogen) staining was carried out according to manufacturer’s 
instructions. Briefly, 2 x 10° log-phase or purified aged cells were washed once 
in 10mM HEPES, pH 7.6, plus 5% glucose and resuspended in 1 ml of the same 
buffer containing 175 nM DiOC,. Cells were incubated for 15 min at room tem- 
perature and then washed twice with 10 mM HEPES, pH 7.6, plus 5% glucose. 
Cells were resuspended in 10 mM HEPES, pH 7.6, plus 5% glucose for imaging. 
Pho8-SEP imaging was also carried out in DiOC, imaging buffer after incubation 
of cells for 20 min in the buffer. In all live-cell experiments, calcofluor (Sigma) 
staining of bud scars for age determination was carried out by including 5 1g ml’ 
calcofluor in the first post-staining wash step before imaging. 

For fluorescence microscopy analysis, cells were visualized under 60 oil mag- 
nification using a Nikon Eclipse E800 with the appropriate filter set: UV-2E/C 
DAPI for calcofluor; FITC-HYQ for GFP, SEP, quinacrine, CDCFDA, BCECF and 
DiOC.; and G-2E/C TRITC for mCherry. Images were acquired with a CoolSNAP 
HQ’ CCD camera (Photometrics) and quantified and processed using Meta- 
morph version 7.1.1.0 imaging software. Cells that exhibited at least a fourfold 
reduction in mean fluorescence intensity, compared to young, wild-type cells, were 
scored as ‘reduced’ in all figures. 

Lifespan measurement by micromanipulation. Replicative lifespan was mea- 
sured by micromanipulation as previously described, except that cells were grown 
exponentially in YEPD for 15h instead of 3h before lifespan analysis’®. 
Membrane protein preparation. 5 X 10’ cells were resuspended in 50 ull B88 
buffer (2 mM HEPES, pH 7.2, 150 mM KOAc, 5 mM MgOAc and 250 mM sorbi- 
tol) plus protease inhibitors (leupeptin, pepstatin, PMSF and aprotinin) and lysed 
with glass beads for 10min in a Multi-Tube Vortexer (VWR) at 4°C. One- 
hundred microliters of B88 plus protease inhibitors was added to the sample 
and the lysate was centrifuged at 500g for 5 min at 4°C. The supernatant of this 
spin was centrifuged at 20,000g for 10 min at 4°C. The membrane pellet was 
resuspended in 100 ul of SDS-lysis buffer (10 mM Tris-HCl, pH 6.8, 100mM 
NaCl, 1% SDS, 1mM EDTA and 1mM EGTA) plus protease inhibitors. The 
supernatant of the 20,000g spin was used as the cytosolic extract. 
Immunoblotting. Membrane and cytosolic extracts were mixed with 0.25 
volumes of 5X SDS loading buffer (150mM Tris-HCl, pH 6.8, 15% SDS, 25% 
glycerol, 0.02% bromophenol blue, 12.5% B-mercaptoethanol) and heated to 37 °C 
for 15 min. Aliquots of each sample were resolved on 7% polyacrylamide gels and 
transferred to nitrocellulose membranes (Invitrogen). Membranes were blocked 
with 5% non-fat dry milk in PBS-T (137mM NaCl, 2.7mM KCl, 10mM 
Na,HPO,, 2mM KH>PO,, 0.05% Tween 20, pH 7.4) and then incubated with 
primary and HRP-conjugated secondary antibodies (Jackson Immunoresearch). 
Blots were washed in PBS-T after each antibody incubation. Bound peroxidase- 
conjugated antibodies were detected using the SuperSignal West Pico Chemi- 
luminescent substrate (Pierce). Primary antibodies were anti-Vma2 13D11 
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(Invitrogen), anti-Pgk1 22C5 (Invitrogen), anti-Pho8 1D3 (Invitrogen) and anti- 
mCherry Living Colours monoclonal (Clontech). 

Aged-cell mitochondrial suppressor screen. MEP cells expressing Tom70-GFP 
were individually transformed in 96-well format with 280 high-copy 2 |1m plas- 
mids from a tiled genomic DNA library*’. Each plasmid contained a unique 
sequence-verified genomic DNA fragment that expressed at least one of 250 genes 
previously determined to affect mitochondrial morphology when deleted’*’’ 
(some genes were present on more than one plasmid). To identify genes that 
suppressed age-induced changes in mitochondrial morphology, all 280 plasmid- 
expressing strains were individually cultured in 5 ml of YEPD at 30°C for 6h toa 
density of 2X 10*cellsml~! from overnight-saturated cultures of cells grown 
in YC-leucine to maintain plasmids. After 6h in YEPD, B-oestradiol was added 
to cultures at a final concentration of 14M to initiate the MEP and cells 
were cultured at 30 °C for an additional 24h to obtain mother cells with a median 
age of 16 divisions (n = 15). Cells were collected by centrifugation, fixed with 4% 
paraformaldehyde, stained with calcofluor, and imaged by fluorescence micro- 
scopy for mitochondrial morphology as described earlier. Plasmid containing 
strains in which 40% of aged (median age of 16, n = 15) cells exhibited young- 
cell-like tubular mitochondrial morphology were scored as suppressors of age- 
induced mitochondrial dysfunction. To confirm potential suppressors, each can- 
didate gene was transferred using LR clonase from pDONR221 Gateway entry 
plasmids from a previously described collection (HIP) into pAG306-GPD-ccdB 
chr 1 (ref. 36). Plasmids were integrated into chromosome 1 of a Tom70- 
GFP-expressing strain after NotI digestion and strains were aged, purified, and 
examined by fluorescence microscopy as described earlier. Positive integrated 
suppressors (such as VMA1 and VPH2) were confirmed by their ability to maintain 
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young-cell-like mitochondrial morphology in at least 70% (n = 30) of aged cells 
(median age of 17). 


31. Baker Brachmann, C. et a/. Designer deletion strains derived from Saccharomyces 
cerevisiae S288C: a useful set of strains and plasmids for PCR-mediated gene 
disruption and other applications. Yeast 14, 115-132 (1998). 

32. Sheff, M. A. & Thorn, K. S. Optimized cassettes for fluorescent protein tagging in 
Saccharomyces cerevisiae. Yeast 21, 661-670 (2004). 

33. Shaner, N.C. et al. Improved monomeric red, orange and yellow fluorescent 
proteins derived from Discosoma sp. red fluorescent protein. Nature Biotechnol. 22, 
1567-1572 (2004). 

34. Huh, W.-K. et al. Global analysis of protein localization in budding yeast. Nature 
425, 686-691 (2003). 

35. Alberti, S., Gitler, A. D. & Lindquist, S. A suite of Gateway cloning vectors for high- 
throughput genetic analysis in Saccharomyces cerevisiae. Yeast 24, 913-919 (2007). 

36. Hu, Y. et a/. Approaching a complete repository of sequence-verified protein- 
encoding clones for Saccharomyces cerevisiae. Genome Res. 17, 536-543 (2007). 

37. Van Leeuwen, F. & Gottschling, D. E. Assays for gene silencing in yeast. Methods 
Enzymol. 350, 165-186 (2002). 

38. Lindstrom, D.L., Leverich, C.K., Henderson, K.A. & Gottschling, D. E. Replicative age 
induces mitotic recombination in the ribosomal RNA gene cluster of 
Saccharomyces cerevisiae. PLoS Genet. 7, €1002015 (2011). 

39. Morano, K. A. & Klionsky, D. J. Differential effects of compartment deacidification 
on the targeting of membrane and soluble proteins to the vacuole in yeast. J. Cel! 
Sci. 107, 2813-2824 (1994). 

40. Plant, P.J., Manolson, M.F., Grinstein, S. & Demaurex, N. Alternative mechanisms of 
vacuolar acidification in H*-ATPase-deficient yeast. J. Biol. Chem. 274, 
37270-37279 (1999). 

41. Jones, G.M. etal. Asystematic library for comprehensive overexpression screens in 
Saccharomyces cerevisiae. Nature Methods 5, 239-241 (2008). 


©2012 Macmillan Publishers Limited. All rights reserved 


| sid ial Be 


doi:10.1038/nature11722 


Ubiquitin chain conformation regulates recognition 
and activity of interacting proteins 


Yu Ye!, Georg Blaser”, Mathew H. Horrocks’, Maria J. Ruedas-Rama®, Shehu Ibrahim’, Alexander A. Zhukov’, Angel Orte’®, 
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Mechanisms of protein recognition have been extensively studied 
for single-domain proteins’, but are less well characterized for 
dynamic multidomain systems. Ubiquitin chains represent a bio- 
logically important multidomain system that requires recognition 
by structurally diverse ubiquitin-interacting proteins”*. Ubiquitin 
chain conformations in isolation are often different from confor- 
mations observed in ubiquitin-interacting protein complexes, indi- 
cating either great dynamic flexibility or extensive chain remodelling 
upon binding. Using single-molecule fluorescence resonance energy 
transfer, we show that Lys 63-, Lys 48- and Met 1-linked diubiquitin 
exist in several distinct conformational states in solution. Lys 63- and 
Met 1-linked diubiquitin adopt extended ‘open’ and more compact 
‘closed’ conformations, and ubiquitin-binding domains and deubi- 
quitinases (DUBs) select pre-existing conformations. By contrast, 
Lys 48-linked diubiquitin adopts predominantly compact confor- 
mations. DUBs directly recognize existing conformations, but may 
also remodel ubiquitin chains to hydrolyse the isopeptide bond. 
Disruption of the Lys 48-diubiquitin interface changes conforma- 
tional dynamics and affects DUB activity. Hence, conformational 
equilibria in ubiquitin chains provide an additional layer of regu- 
lation in the ubiquitin system, and distinct conformations observed 
in differently linked polyubiquitin may contribute to the specificity 
of ubiquitin-interacting proteins. 

Ubiquitin is involved in most aspects of cell biology, as it serves as a 
post-translational modification of lysine (Lys) residues, regulating many 
processes including protein degradation, cell signalling, trafficking and 
the DNA damage response’. Most of these functions are mediated by 
eight structurally and functionally distinct ubiquitin chain types’, only 
two of which have been studied extensively. Lys 48-linked ubiquitin 
chains target proteins for proteasomal degradation’, whereas Lys 63- 
and Met 1-linked ubiquitin chains have multiple non-degradative roles 
in cell signalling®®. Polyubiquitin signals are decoded by ubiquitin- 
interacting proteins including ubiquitin binding domains’ and DUBs’ 
that bind or hydrolyse ubiquitin chains, respectively. Some ubiquitin- 
interacting proteins can distinguish between different linkage types, 
and trigger a specific downstream response to ubiquitination. 

The distinct cellular roles of differently linked polyubiquitin have partly 
been explained by distinct chain conformations. Crystallographic, 
NMR and small angle X-ray scattering (SAXS) studies have proposed 
‘compact’ conformations for Lys 48-linked diubiquitin* ° (Fig. 1a, b). 
In the prevalent model for Lys 48-linked diubiquitin, ubiquitin moieties 
interact via a hydrophobic patch on ubiquitin® (Fig. 1a), which is also 
the most common recognition site for ubiquitin-interacting proteins’. 
NMR relaxation and residual dipolar coupling measurements sug- 
gested that this conformation is in equilibrium with a second con- 
formation that partly exposes the hydrophobic patch®. A recent 
crystal structure suggested a third compact conformation with exposed 
hydrophobic patches on both moieties” (Fig. 1b). In contrast, Lys 63- 
and Met 1-linked diubiquitin are thought to adopt ‘open’ conforma- 
tions with no interactions between ubiquitin moieties, exposing the 


10,12,13 ( 


hydrophobic patches Fig. 1c, d), although a compact crystal struc- 
ture of Met 1-linked diubiquitin has been reported recently’* (Fig. 1d). 

Notably, the first crystal structures of ubiquitin and diubiquitin 
bound to ubiquitin-interacting proteins (reviewed in refs 2, 3, 7) indi- 
cate that in some cases known diubiquitin structures are incompatible 
with binding to ubiquitin-interacting proteins. In particular, DUBs 
must interact with the isopeptide linkage between ubiquitin moieties, 
which is not accessible in known models of Lys 48-linked polyubi- 
quitin (Fig. la, b). This suggests that compact ubiquitin chain types 
might undergo remodelling (‘opening’) to be hydrolysed by DUBs. 
Whether such chain opening is induced by DUB binding, or whether 
Lys 48-linked diubiquitin pre-exists in open conformations, is unclear. 

To understand the principles governing ubiquitin chain recognition, 
we generated Lys 48-, Lys 63- and Met 1-linked diubiquitin (termed 
K48NC, K63NC and MINC, respectively), each containing a fluorescence 
resonance energy transfer (FRET) compatible dye pair (Alexa488/ 
Alexa647, Ry = 5.6nm). Linkage-specific assembly reactions and/or 
selective purification resulted in pure dual-labelled diubiquitin (see 
Fig. le, Supplementary Fig. 1 and Supplementary Methods). Mass- 
spectrometric and enzymatic analysis, interchanging dye positions, 
and ensemble measurements of fluorescence lifetime and dye aniso- 
tropy confirmed sample quality and dye stability (Supplementary Figs 
1 and 2 and Supplementary Methods). Dye photophysics were not 
significantly altered in labelled diubiquitin (see Supplementary Fig. 2 
and Supplementary Methods). All samples showed a FRET signal in 
ensemble measurements (Fig. 1f). 

Distinct ubiquitin chain conformations that underwent FRET were 
detected at the single molecule level by excitation of the donor fluo- 
rophore, with a single laser, and monitoring emission of both donor 
and acceptor fluorophores. The resulting FRET histograms were fitted 
to Gaussian functions representing distinct populations of diubiquitin 
conformations (Fig. 2; see Supplementary Methods). To estimate the 
proportion of molecules in conformations that gave no detectable FRET 
(termed ‘non-FRET’), two-colour coincidence detection (TCCD; see 
Supplementary Methods)'* was used on the same sample. In TCCD, 
two lasers are used to excite directly the two fluorophores in the diubi- 
quitin independently, allowing direct quantification of the number of 
molecules with both donor and acceptor fluorophores. Combined use 
of TCCD and FRET measurements thus enabled estimation of the 
proportion of molecules that were in non-FRET conformations (see 
Supplementary Information). 

For K48NC, two distinct FRET populations could be resolved (Fig. 2a): 
a high-FRET population (FRET efficiency E~ 0.69) representing 
~90% of all molecules, and a low-FRET population (E ~ 0.41) 
accounting for the remaining ~10%. We were unable to detect any 
non-FRET populations in these experiments; however, we cannot rule 
out the existence ofa small population of ‘hidden’”’® non-FRET K48NC 
species which would be beyond the detection limit (see Supplementary 
Methods). Hidden protein conformations may modulate enzyme 
activity but are not easily detected"®. 
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a Lys 48-linked diUb b Lys 48-linked diUb Figure 1 | Ubiquitin chain conformations and 
ensemble FRET measurements. a-d, Distal 
aS (dark) and proximal (light) ubiquitin moieties are 
L—) shown in surface representation with hydrophobic 
patch (HP) residues (Leu 8, Ile 44, His 68, Val 70) in 
blue. N- and C-terminal Alexa dye attachment 
points are indicated. a, Lys 48-linked diubiquitin 
(diUb) derived from crystallographic and NMR 
. bat p A analysis. Protein Data Bank (PDB) accession codes 
Ox Wa 1AAR (left)* and 2PE9 (right)°®. b, Alternative 
= % * Proximal ‘compact’ Lys 48-linked diubiquitin (PDB 3AUI"). 
c, ‘Open’ Lys 63-linked diubiquitin (PDB 2JF5"’). 
d Met 1-linked diUb ¥ Alexa4838 d, Met 1-linked diubiquitin from crystallography 
¥ Alexa647 (PDB 2W9N (left)'* and 3AXC (right)"*). 

Distal e, Schematic representation of FRET-labelled 
diubiquitin assembly. See Supplementary Fig. 1 
and Supplementary Methods. f, Uncorrected 
ensemble FRET measurements for diubiquitin 
used in this study (see Supplementary Methods). 
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Similar to the results for K48NC, multiple conformations were 
also observed for K63NC and M1NC, with ~70-75% low-FRET and 
~25-30% non-FRET populations (Fig. 2b, c). Models of extended 
Lys 63- or Met 1-linked diubiquitin (Fig. 1c, d) are compatible with 
non-FRET populations, whereas the compact crystal structure of 
Met 1-linked diubiquitin’* (Fig. 1d) may represent a low-FRET 
MINC species. The prevalence of compact Lys 63-linked diubiquitin 
conformations in FRET is surprising, and has not been observed by 
other methods'*’*"*. This may be due to multiple compact and semi- 
compact conformations that collectively result in the observed low- 
FRET populations. 

To test whether the observed diubiquitin conformations are relevant 
for ubiquitin-interacting protein interaction, single-molecule measure- 
ments were performed using pM concentrations of K63NC, K48NC or 
MINC mixed with ubiquitin-interacting proteins at concentrations 
exceeding the dissociation constant (Kq) of the interaction (Supplemen- 
tary Fig. 7). With our methodology we can follow relative changes in 
the populations upon addition of ubiquitin-interacting proteins, none 
of which affected dye photophysics (Supplementary Fig. 8). Several 
ubiquitin-interacting proteins interacted with pre-existing compact 
conformations of diubiquitin. A Lys 63 linkage-specific antibody 
increased the FRET population relative to the non-FRET population 
of K63NC, in agreement with structural work!” (Fig. 3a, b). Similarly, 
the ubiquitin-binding UBAN domain of NEMO" enriched the low- 
FRET population of M1NC, consistent with UBAN binding to a com- 
pact conformation of linear diubiquitin’® (Fig. 3c, d). 

Access to the isopeptide bond is essential for DUB activity. AMSH- 
like protein (AMSH-LP; also called STAMBPL1), a Lys 63-specific 
JAMM family DUB, binds an open Lys 63-linked diubiquitin con- 
formation” (Fig. 3e). Indeed, inactivated AMSH”° (AMSHi) depleted 
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Figure 3 | Ubiquitin-interacting protein binding to Lys 63- and 

Met 1-linked chains. a—f, Structural models (coloured as in Fig. 1a with yellow 
ubiquitin-interacting protein), FRET efficiency histograms and population 
estimates are shown. All experiments are normalized to equivalent TCCD areas 
(see Supplementary Fig. 12 for Z-parameter plots). a, Lys 63-linked diubiquitin 
(PDB 2JF5"*). b, Complex of Lys 63-linked diubiquitin and Lys 63-linkage- 


268 | NATURE | VOL 492 | 13 DECEMBER 2012 


c Met1-diUb 


d NEMO UBAN 


low-FRET conformation 


the FRET and increased the non-FRET population of K63NC in 
single-molecule measurements (Fig. 3e). Structures of ubiquitin- 
specific protease (USP) DUBs with diubiquitin bound across the 
active site are unavailable, but monoubiquitin complexes show that 
the carboxy terminus of a distal ubiquitin is extended’ (Fig. 3f), sug- 
gesting that USPs also bind open conformations. Indeed, inactivated 
USP21 (ref. 21; USP21i) enriched non-FRET K63NC conforma- 
tions in our single-molecule measurements (Fig. 3f). Therefore, 
whereas the Lys 63 linkage-specific antibody or the UBAN domain 
of NEMO selects existing compact conformations of K63NC and 
MING, respectively, DUBs select pre-existing open conformations 
of K63NC (Fig. 3g). 

K48NC is in equilibrium between predominantly compact confor- 
mations (Fig. 2a). Inactivated OTUB1 (OTUB1i), a Lys 48-specific 
DUB, enriched the low-FRET and depleted the high-FRET popula- 
tion of K48NC in single-molecule measurements (Fig. 4a, b and 
Supplementary Fig. 9). This is consistent with recent crystal structures 
of OTUB1 in complex with E2 and two ubiquitin molecules” 
that revealed a relatively compact conformation of the two ubiquitin 
moieties when bound to OTUB1 (Fig. 4c). 

Notably, titration of K48NC with USP21i resulted in depletion of 
high-FRET and an increase in low-FRET populations, but also gave 
rise to a non-FRET population of K48NC (Fig. 4d, e). The appearance 
of open, non-FRET K48NC species can be rationalized structurally, as 
USP21 stretches the linkage across the active site to form a catalytically 
competent conformation (Fig. 3f). However, the increase in low-FRET 
populations indicates that USP21i binds semi-open conformations 
directly (Fig. 4d, e). Estimation of binding constants for the low- 
and non-FRET species indicated a slightly higher affinity of USP21i 
for the open non-FRET conformation (Supplementary Fig. 9). 
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specific antibody (PDB 3DVG"’). c, Met 1-linked diubiquitin (PDB 2W9N"’). 
d, Complex of met 1-linked diubiquitin and NEMO UBAN (PDB 2ZVO%, only 
one diubiquitin shown). e, Complex of Lys 63-linked diubiquitin and AMSH- 
LP (PDB 2ZNV”’). f, Complex of monomeric ubiquitin with USP21 (PDB 
2Y5B", proximal ubiquitin indicated). g, Model for Lys 63- and Met 1-linked 
diubiquitin interaction with binding partners. 
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Figure 4 | DUB interaction with Lys48-linked chains. a, FRET efficiency 
histograms of K48NC with increasing OTUB1i concentration. Estimated FRET 
efficiencies: Enigh-rret ~ 0.69, Ejow-rret ~ 0.45 (Supplementary Fig. 13 and 
Supplementary Methods). b, Relative changes of K48NC populations from 

a. Errors represent standard deviation in curve fitting. c, Structure of OTUB1 
(yellow) bound to two ubiquitin molecules (grey) (derived from PDB 


Our data result in two models for DUB interactions with Lys 48- 
linked ubiquitin chains (Fig. 4f). DUBs such as OTUB1 may directly 
recognize and hydrolyse low-FRET semi-open conformations of 
Lys 48-linked diubiquitin. Other enzymes such as USP21 recognize 
semi-open and open (that is, low-FRET and non-FRET) conforma- 
tions. Binding of semi-open conformations could lead to remodelling 
to open conformations that are compatible with catalysis (Fig. 4f). 
Alternatively, dissociation of low-FRET and re-binding of non-FRET 
diubiquitin could account for generating a catalytically competent 
enzyme-substrate complex (Fig. 4f). Both models imply that recognition 
of Lys 48 linkages is governed initially by conformational selection. 

Importantly, our data indicate that the compact, high-FRET popu- 
lation of K48NC is not recognized directly. Access to ubiquitin hydro- 
phobic patches is obstructed in the prevalent compact structure of 
Lys 48-linked diubiquitin (Fig. la and refs 8-10), which would corre- 
late with a high-FRET population. Because all known DUBs bind the 
hydrophobic patch of ubiquitin’, interconversion from compact high- 
FRET to semi-open low-FRET conformations could be crucial for DUB 
activity (Fig. 4f). To test this hypothesis, we mutated Ile 44 in the hydro- 
phobic patch of the proximal ubiquitin of K48NC (K48NC(I44E)) to 
disrupt a hydrophobic-patch-based interface. We observed a reduc- 
tion in the high- and low-FRET populations and a shift in their peak 


¥@@« Non-FRET (E < 0.1) 


4DHZ”*). d, FRET efficiency histograms of K48NC with increasing USP21i 
concentrations as in a (Epigh-rrer ~ 0-71, Ejow-rrer ~ 0.37). €, Relative changes 
of K48NC populations derived from d, shown as in b. f, Model for interaction of 
DUBs with Lys 48-linked chains. g, Initial rate analysis of USP21-mediated 
hydrolysis of K48NC (green) and K48NC(144E) (red). Errors represent 
standard deviation of triplicate measurements. 


positions to lower FRET efficiencies as compared to K48NC (Sup- 
plementary Fig. 10), indicating that the mutation changes the confor- 
mational equilibria such that open conformations, as well as alternative 
compact forms of this chain type, may now be populated (for example, 
Fig. 1b and ref. 11). Importantly, kinetic assessment of USP21- 
mediated cleavage shows that K48NC(I44E) is hydrolysed significantly 
faster than wild-type K48NC (Fig. 4g). We propose that ubiquitin 
chain conformation and dynamics directly affect the rate of DUB 
hydrolysis. 

We show here how FRET/TCCD measurements can be used to 
probe the complex conformational dynamics of ubiquitin chains in 
solution, thereby identifying novel conformations not detected using 
NMR or X-ray crystallography. Our results establish that distinct con- 
formational populations of flexible two-domain ubiquitin chains are 
recognized by ubiquitin-interacting proteins. In the case of USP21, we 
suggest that subsequent remodelling may take place to achieve active 
conformations. Such a combination of ‘conformational selection’ 
and remodelling would be consistent with the most recent models of 
macromolecular recognition™, and may further extend these models, 
as the remodelling step here involves significant translation and pre- 
sumably rotation of flexibly linked domains with respect to each 
other. Hence, conformational selection is an important mechanism of 
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ubiquitin chain recognition. This highlights the importance of under- 
standing the conformational space for the eight different ubiquitin 
chain types, as this holds the key to linkage-dependent regulation 
within the ubiquitin system. 

Ubiquitin chain recognition further depends on the dynamic inter- 
conversion of chain conformations. Interfering with conformational 
dynamics by mutating ubiquitin can directly affect chain hydrolysis by 
DUBs, providing the first evidence that conformational rearrange- 
ments in the chains may govern the speed at which the chains are 
hydrolysed. An alternative mechanism to interfere with the conforma- 
tional dynamics is chain length. Lys 48-tetraubiquitin forms a compact 
structure” in which all ubiquitin moieties interact with each other 
through their hydrophobic patches and secondary interaction sites (Sup- 
plementary Fig. 11). This probably alters chain dynamics (‘breathing’ 
of the chains), which we have shown here to be essential for recog- 
nition by DUBs. Consistent with this are recent data showing that 
some DUBs remove mono- and diubiquitin but not tetraubiquitin 
or longer polymers from substrates”*. It is tempting to speculate that 
the reported requirement for Lys 48-linked tetraubiquitin to trigger 
proteasomal degradation”’ may have originated partly from improved 
stability of this length/linkage combination towards DUB action. 
Therefore, factors affecting ubiquitin chain conformation and dyna- 
mics (linkage, length, binding partners) may be key regulators of the 
ubiquitin system. 


METHODS SUMMARY 


Diubiquitin molecules were assembled with Alexa488 and Alexa647 fluorophores 
as detailed in Supplementary Fig. 1. Single-molecule measurements were performed 
at 20 °C in PBST buffer (150 mM NaCl, 18 mM Na;HPO,, 7 mM NaH>PO,, 0.01% 
(v/v) Tween20, pH 7.4). Ubiquitin-interacting proteins were incubated with labelled 
chains for 5min. The single-molecule instrument was described previously”. 
Recording times depended on the amount of FRET-labelled diubiquitin and the 
signal-to-noise ratio, and were typically between 30 min and 3 h. Both TCCD and 
FRET data were collected for all measurements, and were analysed using a coin- 
cidence criterion. Coincident events in both channels”’ are only selected when the 
counts in each channel are above its threshold count value, which was selected 
automatically as described previously”. For experiments in which TCCD data 
were compared with FRET data, a common threshold was used for the donor 
channels (determined by maximizing the association quotient in the TCCD 
experiment), whereas the acceptor channel thresholds were independently deter- 
mined for FRET and TCCD. This ensured that both the FRET and TCCD mea- 
surements of each sample resulted in approximately equal burst rates in the donor 
channel. Bulk FRET measurements, dye lifetime measurements and periodic 
acceptor excitation (PAX) experiments were performed as described in Sup- 
plementary Methods. 
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Activated GTPase movement on an RNA scaffold 
drives co-translational protein targeting 


Kuang Shen’, Sinan Arslan”, David Akopian', Taekjip Ha? & Shu-ou Shan! 


Approximately one-third of the proteome is initially destined for 
the eukaryotic endoplasmic reticulum or the bacterial plasma 
membrane’. The proper localization of these proteins is mediated 
by a universally conserved protein-targeting machinery, the signal 
recognition particle (SRP), which recognizes ribosomes carrying 
signal sequences” ~* and, through interactions with the SRP recep- 
tor®®, delivers them to the protein-translocation machinery on the 
target membrane’. The SRP is an ancient ribonucleoprotein par- 
ticle containing an essential, elongated SRP RNA for which precise 
functions have remained elusive. Here we used single-molecule 
fluorescence microscopy to show that the Escherichia coli SRP- 
SRP receptor GTPase complex, after initial assembly at the tetra- 
loop end of SRP RNA, travels over 100 A to the distal end of this 
RNA, where rapid GTP hydrolysis occurs. This movement is nega- 
tively regulated by the translating ribosome and, at a later stage, 
positively regulated by the SecYEG translocon, providing an 
attractive mechanism for ensuring the productive exchange of 
the targeting and translocation machineries at the ribosome exit 
site with high spatial and temporal accuracy. Our results show that 
large RNAs can act as molecular scaffolds that enable the easy 
exchange of distinct factors and precise timing of molecular events 
in a complex cellular process; this concept may be extended to 
similar phenomena in other ribonucleoprotein complexes. 

Co-translational protein targeting faces fundamental challenges in 
both spatial and temporal coordination. Spatially, both the SRP?“ and 
SecYEG (known as Sec61 in eukaryotes) translocon’ contact the L23 
ribosomal protein and the signal sequence, raising puzzling questions 
about how the translating ribosome is transferred from the targeting to 
the translocation machinery. Temporally, GTP hydrolysis by the SRP- 
SRP receptor complex, which drives its irreversible disassembly’, must 
be accurately timed during cargo delivery and unloading to avoid 
abortive reactions’. Such accurate spatial and temporal coordina- 
tion is required in all protein-targeting pathways, but its underlying 
molecular mechanism is not understood. Here, single-molecule 
experiments reveal large-scale rearrangements in the SRP, providing 
a unifying molecular mechanism to explain how such coordination is 
achieved during co-translational protein targeting. 

The bacterial SRP is comprised of the homologue of an SRP54 
protein subunit, Ffh, and a 114-nucleotide SRP RNA. Ffh contains 
two domains connected by a flexible linker: a methionine-rich 
M domain, which recognizes the signal sequence’® and binds the SRP 
RNA", and a GTPase NG domain that interacts with a homologous 
NG domain in the SRP receptor, FtsY°* (Fig. la). The SRP RNA is a 
universally conserved and essential SRP component, but its precise 
roles are not completely understood. Most previous work’?"* focused 
on the GGAA tetraloop that caps one end of this RNA (Supplementary 
Fig. 1a), which accelerates the initial SRP-FtsY assembly by electro- 
statically interacting with FtsY'’. These findings, however, do not 
explain why the SRP RNA has a conserved elongated structure’®. 
Valuable clues come from a recent crystal structure that found the 
Ffh-FtsY GTPase complex at another docking site near the distal 


end of this RNA, where mutations disrupt GTPase activation’” 
(Fig. 1a, distal state). This posited an attractive hypothesis in which 
the Ffh-FtsY GTPase complex, after initial assembly near the tetra- 
loop’*""*, can re-localize to the distal site of the SRP RNA ~100A 
away”’. Nevertheless, no functional evidence for the re-localization is 
available, nor are the importance, timing, mechanism and regulation of 
such a large-scale movement understood. 

To address these questions, we used single-molecule fluorescence 
resonance energy transfer (smFRET) and total internal reflection 
fluorescence (TIRF) microscopy to detect conformational dynamics 
of individual SRPs"*”. Migration of the SRP-FtsY GTPase complex on 
the SRP RNA was tracked using FRET between a donor (Cy3) attached 
to the FtsY NG domain and an acceptor (Quasar 670) labelled near 
the RNA distal end (Fig. 1a). Stable SRP-FtsY complexes, formed with 
the non-hydrolysable GTP analogue 5’-guanylyl-imidodiphosphate 
(GMPPNP), displayed rapid transitions among multiple FRET states 
(Fig. 1b, c). A low FRET state (~0.1; L) was assigned to the proximal 
state in which the GTPase complex resides near the SRP RNA tetra- 
loop’’. A high FRET state (~0.8; H) was attained ~20% of the time 
and assigned to the distal state in which the GTPase complex stably 
docks at the distal site, as verified below. Cy3 attached to the Ffh NG 
domain showed similar transitions but with a lower FRET value in the 
H state (Supplementary Fig. 2a, b), consistent with Ffh being further 
from the distal site than FtsY’’. These results directly demonstrate 
dynamic movements of the SRP-FtsY GTPase complex on the SRP 
RNA that span over 100 A. 

We used hidden Markov modelling (HMM)-based statistical ana- 
lyses to determine the most likely sequence of FRET transitions”. This 
revealed an ensemble of additional states with intermediate FRET 
values (0.3-0.6; M1 and M2) and extremely short lifetimes (Fig. 1b-d 
and Supplementary Figs 2b-g and 3a-c), representing alternative 
binding modes of the GTPase complex on the SRP RNA. The transition 
information was pooled into a transition density plot that describes the 
number of distinct FRET states, their FRET values, and their transition 
frequencies (Fig. le and Supplementary Fig. 2h). In addition, the kine- 
tics of FRET transitions were obtained from dwell-time analyses 
(Fig. 1f, g and Supplementary Figs 2i-o and 3d—-h). Whereas molecules 
leaving the L state rapidly transitioned to all the other states, the H state 
had a longer lifetime than M1 and M2 and was hence more populated 
(Fig. 1d), indicating more stable docking of the GTPase complex in this 
state. Fifty-eight per cent of transitions to the H state occurred directly 
from L, whereas molecules in the intermediate FRET states transitioned 
primarily back to L (Fig. le and Supplementary Fig. 2h). Thus, correct 
docking at the RNA distal site requires extensive searching that involves 
frequent trial and error. 

To test whether the H state is responsible for GTPase activation, we 
isolated mutant RNAs that specifically perturb the distal docking site. 
The 82-nucleotide RNA lacking this site'’ reduced GTPase activation 
sixfold, whereas a ‘super-active’ mutant, 99A, enhanced GTP hydro- 
lysis 2.5-fold (Fig. 2a, green bars and Supplementary Figs 1b and 4a). 
The GTPase activity of these mutants quantitatively correlated with 
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Figure 1 | smFRET-TIRF microscopy reveals dynamic movements of the 
SRP-FtsY complex on the SRP RNA. a, smFRET setup for the SRP-FtsY 
complex. FtsY Cys 345 is labelled with Cy3. The 5'-end of the DNA splint 

(2 nucleotides from the 3'-end of SRP RNA) is labelled with Quasar 670. 

b, Fluorescent signals (top) and FRET trajectory (bottom) of the SRP-FtsY 
complex in GMPPNP. HMM of the FRET trajectory is depicted in navy. The 
arrow denotes the bleaching of Cy3, after which Quasar 670 was excited using a 


their efficiency of reaching the H state (Fig. 2a and Supplementary 
Figs 5 and 6), strongly suggesting that activated GTP hydrolysis occurs 
at the RNA distal site. 

To test the importance of the RNA distal site in protein targe- 
ting, we measured the ability of SRP and FtsY to deliver a model 
substrate, preprolactin (pPL), to endoplasmic reticulum microsomes”’. 
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Figure 2 | The distal site of SRP RNA is crucial for GTPase activation and 
protein targeting. a, Correlation between GTPase rate constants in the SRP- 

FtsY complex (green) and the frequency of reaching the high FRET state (purple) 
for wild-type, 82-nucleotide (nt) and 99A SRP RNA. Data represent mean + s.d. 
(n = 5).b, ¢, Co-translational targeting and translocation of pPL (b) and its signal 
sequence variants (c) mediated by the wild-type and mutant SRPs. Reactions in 
the absence of SRP RNA are in black. Data represent mean + s.d. (n = 3). 
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635-nm laser to confirm the presence of the complex. A.U., arbitrary units. 

c, Magnification of the grey box in b to depict the four FRET states resolved by 
HMM. d, smFRET histogram depicting the distribution of molecules in 
different states. In M state, the M1 and M2 states are binned together. 

e, Transition density plot for the GTPase movements. f, g, Analysis of the 
transition kinetics between L and H states (f). Exponential fits of the data gave 
the transition-rate constants (k) given in g. 


Translocation of pPL results in cleavage of its signal sequence, allowing 
the targeting and translocation to be efficiency quantified (Supplemen- 
tary Fig. 4b). Furthermore, the specificity of targeting was tested using 
pPL variants in which the signal sequence is systematically varied’ 
(Supplementary Fig. 4d). Mutant 82-nucleotide RNA substantially 
reduced the targeting of correct substrates (wild type, 8L-pPL and 
7L-pPL,; Fig. 2b, cand Supplementary Fig. 4c, e). By contrast, the super- 
active 99A RNA targeted these substrates more efficiently than wild- 
type SRP, without compromising the discrimination against incorrect 
substrates (Fig. 2b, cand Supplementary Fig. 4c, e). Thus the SRP RNA 
distal site, although reportedly not essential for cell survival'’, does 
enhance efficient and accurate co-translational protein targeting. 

SRP and FtsY undergo an unusual GTPase cycle, driven by multiple 
conformational rearrangements in their heterodimer that culminate in 
GTPase activation (Fig 3a—d, cartoons)”***. We questioned how these 
rearrangements within the GTPase complex drive its global movements 
on the SRP RNA, using conditions that block the GTPase cycle at distinct 
stages**”*, SRP by itself exhibited no movements on the RNA (Fig. 3a and 
Supplementary Fig. 7a). Recruitment of FtsY begins with a transient 
early intermediate, which lacks close contacts between the G domains 
and hence can be isolated by leaving out GTP analogues’***. No GTPase 
movement was observed at this stage either (Fig. 3b and Supplementary 
Fig. 7b). Subsequently, GITP-dependent rearrangements give a stable 
closed complex, which lacks optimal positioning of the catalytic loops 
and can be isolated by a mutation, FtsY(A335W), in the catalytic loop 
(Fig. 3c, d)?**?. Although GTPase movements were observed in the 
closed complex, most of them only reached M1 and M2 but did not 
extensively populate the H state (Fig. 3c—-e and Supplementary Fig. 7c, d). 
Thus, GTP-induced rearrangements within the NG domain complex 
drive its global movements on the SRP RNA. Moreover, stable GTPase 
docking at the RNA distal site requires optimal positioning of the cata- 
lytic loops, explaining why mutants that block GTPase activation, such 
as FtsY(A335W), severely impair protein targeting”. 

If the GTPase complex only transiently reaches the SRP RNA distal 
site where GTPase activation occurs, previous ensemble measure- 
ments* would have considerably underestimated the hydrolysis rate. 
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Figure 3 | Conformational rearrangements within the SRP-FtsY GTPase 
complex drive its movement to the RNA distal site. a-d, smFRET histograms 
of free SRP in the open state (a) and of the SRP-FtsY complex in the early 
(b), closed (c) and activated (d) states. Conditions for isolating each 
conformational state are described in the text and Methods. e, Summary of the 


We therefore performed real-time GTPase assays using the smFRET 
setup. If GTP hydrolysis at the distal site, which drives irreversible 
SRP-FtsY dissociation, occurred faster than their return to the pro- 
ximal state, we would observe high FRET ‘bursts’ with GTP instead of 
the reversible transitions with GMPPNP. This was indeed observed 
(Fig. 3f). The duration between these bursts has a rate constant 
(0.59 '; Supplementary Fig. 8b) expected for rearrangement to the 
activated complex (~1 s ')” and is similar to the ensemble GTPase 
rate (0.7s ')’, strongly suggesting that the latter is rate limited by 
GTPase movement to the RNA distal site. The duration of the high 
FRET bursts includes GTP hydrolysis and subsequent SRP-FtsY 
disassembly and exhibits a rate constant of 7.1s ' (Supplementary 
Fig. 8a), providing a lower limit for the actual hydrolysis rate and is 
at least tenfold faster than ensemble measurements’. 

These results also show that GTP drives almost irreversible move- 
ment of the GTPases to the RNA distal site, necessitating accurate 
control of the timing of this movement. Indeed, ribosome-nascent 
chain complexes (RNC or cargo) delay GTPase activation in the 
SRP-FtsY complex®”? (Fig. 4a, wild type). This effect, termed ‘pausing’, 
prevents premature GTP hydrolysis and is essential for ensuring the 
efficiency and specificity of the SRP pathway’. We wanted to know 
whether the RNC negatively regulates the GTPase movement to the 
SRP RNA distal site. RNCpsq, which carries an obligate SRP substrate 
FtsQ, completely abolished the GTPase movements on the RNA 
(Fig. 4b and Supplementary Fig. 9a—e). This is specific to the correct 
cargo, as RNC carrying luciferase, which contains no signal sequence, 
exerted no effects (Fig. 4c and Supplementary Fig. 9f). Further, GTP 
hydrolysis in the presence of RNC is no longer affected by mutations in 
the RNA distal end (Fig. 4a and Supplementary Fig. 10a, b), but is still 
reduced by a mutation in FtsY active site** (Supplementary Fig. 10d, e). 
These results demonstrate that correct cargos stabilize the GTPase 
complex in the proximal state and prevent its re-localization to the 
RNA distal site, thus exerting the pausing effect. 

On the target membrane, RNC must be transferred from the tar- 
geting to the translocation machinery. Both the mechanism of this 
transfer and its timing have remained long-standing challenges. To 
test whether the translocon helps to regulate these events, we added the 
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SecYEG complex to the RNCpsq-SRP-FtsY complex (Supplementary 
Fig. lla). SecYEG restored the high FRET state (Fig. 4d, e). It also 
reversed the cargo-induced pausing and restored efficient GIP hydro- 
lysis (Fig. 4a and D.A. and S.-o.S., manuscript in preparation). Neither 
effect was observed with the detergent n-dodecyl-f-p-maltoside (DDM) 
alone (Supplementary Fig. 11b) nor with mutant 82-nucleotide RNA 
(Fig. 4f and Supplementary Fig. 10c) or FtsY(A335W) (Supplementary 
Fig. 11c). Thus, SecYEG drives productive docking of the GTPase 
complex at the RNA distal site and thus re-activates GTP hydrolysis. 

Next we wanted to know how SecYEG restores GTPase movements. 
Although SecYEG could simply remove the RNC from the SRP-FtsY 
complex, the following strongly suggests that this is not the case. 
Compared to the SRP-FtsY complex alone, GTPase movements in 
the presence of RNCyisq and SecYEG displayed a distinct pattern, 
characterized by fewer transitions to intermediate FRET states, more 
frequent docking (Fig. 4g, h) and longer dwell times in the H state 
(Fig. 4d, i). These SecYEG-induced changes were not observed without 
RNC (Supplementary Fig. 11d, e). To test directly whether RNC remains 
on the targeting complex, we labelled the RNC with Alexa 647, which 
was found to co-localize with labelled SRP (Supplementary Fig. 12). 
These co-localized spots remain after incubation with SecYEG (Sup- 
plementary Fig. 12c, d), indicating that RNC was not displaced by 
SecYEG. These data imply that SecYEG forms a quaternary complex 
with RNC, SRP and FtsY, which could represent a transient inter- 
mediate in the targeting and translocation reaction. These results also 
suggest that SecYEG drives the GTPase movement through two 
mechanisms: (1) displacing the GTPase complex from the proximal 
site, as indicated by the reappearance of the H state even with RNC 
present (compare Fig. 4d with Supplementary Fig. 9b); and (2) 
prolonging productive docking at the RNA distal site (Fig. 4i). 
Finally, non-productive movements to intermediate FRET states are 
minimized with RNC and SecYEG present (Fig. 4g, h). Considering 
the size of SRP RNA relative to the ribosome, the RNC possibly masks 
non-productive GTPase docking sites on the SRP RNA, which could 
also explain the conserved length of this RNA. 

In summary, we demonstrate that the SRP RNA provides a molecu- 
lar scaffold that mediates large-scale movements of the SRP-FtsY 
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Figure 4 | RNC and SecYEG regulate GTPase movements on the SRP RNA. 
a, Effect of RNCps5q (grey) and SecYEG (purple) on the GTPase activity of the 
SRP-FtsY complex assembled with the wild-type, 82-nucleotide and 99A SRP 
RNA. Data represent mean + s.d. (n = 3). b, c, smFRET histograms of the 
SRP-FtsY complex bound to RNCpisQ (b) or RNChaciferase (€). d, Fluorescent 
signals (top) and FRET trajectory (bottom) of the RNCisqg-SRP-FtsY complex 
in the presence of SecYEG. e, f, smFRET histograms of the RNCp,,q-SRP-FtsY 
complex in the presence of SecYEG, with the wild-type (e) or 82-nucleotide 


complex, which is tightly regulated by the GTPase cycle of SRP 
and FtsY, the translating ribosome and the SecYEG translocon. 
Together with previous studies, we propose a molecular model for 
co-translational protein targeting (Fig. 4j). Upon cargo recognition 
(step 1), the SRP RNA tetraloop is optimally positioned adjacent to 
the Ffh NG domain, allowing efficient recruitment of FtsY near the 
ribosome exit site'*'> (step 2). GTP-induced rearrangements in the 
SRP-FtsY NG-domain complex prime*”* but are insufficient to 
release this complex for docking at the distal site owing to the pausing 
effect of RNC. SecYEG is required to drive GTPase re-localization to 
the SRP RNA distal site (step 3). This vacates the ribosome exit site and 
allows SecYEG to initiate contacts with L23, thus enabling the coordi- 
nated transfer of RNC from the targeting to translocation machinery 
(step 4). Concomitantly, GTPase docking at the RNA distal site trig- 
gers rapid GTP hydrolysis, driving the disassembly and recycling of 
SRP and FtsY (step 4 and 5). This provides an attractive mechanism to 
allow the concerted exchange of SRP and SecYEG at the ribosome and 
the precise timing of GTP hydrolysis, thus minimizing abortive reac- 
tions due to premature SRP—-FtsY disassembly or non-productive loss 
of cargo. 

Nucleic-acid-mediated protein movement is a veidespsead phenom- 
enon and has been observed with the spliceosome”’, helicases*”** and 
type I restriction endonucleases””°. Our results enrich these findings 
and further suggest that large RNA molecules can provide useful mole- 
cular scaffolds to coordinate multiple protein interactions and large- 
scale protein rearrangements, thus enabling productive exchange of 
different factors and precise timing of molecular events in a cellular 
pathway. This may provide general principles for understanding sim- 
ilar phenomena in other ribonucleoprotein particles. 
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(f) SRP RNA. g, Transition density plot of G[Pase movements in the presence 
of RNCysq and SecYEG. h, Summary of the percentage of molecules that 
exhibit the specified transitions. In the presence of RNCpisq and SecYEG, 
transitions to intermediate FRET states are almost completely reduced. 

i, Scatter plot of the transition dwell times of individual molecules in the 
absence (grey circles) and presence of RNCpisq and SecYEG (red crosses). 

j, Model for the role of the SRP-RNA-mediated GTPase re-localization in co- 
translational protein targeting. 


METHODS SUMMARY 

Materials. Protein, SRP RNA, messenger RNA, RNC and SecYEG were prepared 
as described in Methods. Ffh and FtsY were labelled with Cy3 at a unique cysteine 
residue, Cys153 and Cys345, respectively, using maleimide chemistry as 
described in ref. 12. Labelled DNA splint was purchased from Biosearch 
Technologies. Translation extracts and salt-washed, trypsin-digested microsomal 
membranes were prepared as described in ref. 21. 

Single-molecule instrumentation and imaging. Olympus IX-81 microscope 
was modified as described previously in ref. 19. SRP was immobilized by extending 
the 5’-end of SRP RNA, which hybridizes to a complementary oligonucleotide 
(DNA splint) 3’-labelled with biotin and thus coupled to PEGylated, streptavi- 
din-coated coverslip surface through biotin-streptavidin interaction. Fluorescent 
signals were split by DualView (Photometrics) and recorded with an EMCCD 
camera (Andor). The data were extracted and analysed by scripts written in IDL 
and MATLAB”. HMM was carried out using the HaMMy program”, and the 
transition heat-map was plotted with TDP software”. Details of the instru- 
ment setup, sample immobilization, data collection and analyses are described 
in Methods. 

Biochemical assays. Assays for GTPase activity and for co-translational protein 
targeting and translocation were performed and analysed as described previously 
in refs 8 and 21, and are summarized in Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Plasmids. Plasmids for in vivo expression of Ffh, full-length FtsY and SRP RNA, 
and for in vitro transcription of FtsQ, luciferase and pPL and its signal sequence 
variants have been described previously*”*'. The pEK20 construct for SecYEG 
expression was a gift from A. Driessen’’. Plasmids for mutant SRP RNAs and 
mutant proteins were constructed using the QuikChange mutagenesis protocol 
(Stratagene) following manufacturer’s instructions. The plasmid for in vitro tran- 
scription of hammerhead ribozyme-SRP RNA-hepatitis delta virus (HDV) ribo- 
zyme was a gift from A. Ferre-D’Amare*. The hammerhead coding sequence was 
removed and the 5’-end of SRP RNA was extended using the QuikChange muta- 
genesis protocol to make in vitro transcription constructs for single-molecule RNA 
(smRNA) constructs. 

Protein preparations. Wild-type and single-cysteine mutants of Ffh and FtsY 
were expressed and purified as described previously’. In brief, Ffh expression was 
induced in logarithmically growing BL21(DE3)pLysE cells with 1 mM isopropyl- 
B-p-thiogalactoside (IPTG). The soluble fraction from lysed cells was purified by 
cation-exchange chromatography on SP Sepharose Fast Flow resin (GE Health- 
care) using a gradient of 0.25-1 M NaCl, and was further purified by gel-filtration 
chromatography on the Superosel2 column (Amersham Biosciences). His¢- 
tagged full-length FtsY was expressed in BL21(DE3)pLysS cells by induction 
with 0.5 mM IPTG in logarithmically growing cells. The soluble fraction from 
lysed cells was purified by anion-exchange chromatography using Q Sepharose 
Fast Flow resin (GE Healthcare) with a gradient of 150-500 mM NaCl, 
followed by affinity purification using nickel-nitrilotriacetic acid (Ni-NTA) resin 
(Qiagen). For GTPase assays, FtsY was further purified by anion-exchange 
chromatography on the MonoQ column (Amersham Biosciences) using a 
gradient of 150-350mM NaCl. All proteins were exchanged into SRP buffer 
(50 mM potassium-HEPES, pH 7.5, 150 mM potassium acetate, 2 mM magnesium 
acetate, 2mM dithiothreitol, 0.01% Nikkol) before use. 

Detergent-solubilized SecYEG was expressed in BL21(DE3) cells with 

0.5 mM IPTG and was purified following published procedures***”*. Cells were 
lysed by sonication and the membranes were collected by ultracentrifugation. 
SecYEG was extracted and purified by cation-exchange chromatography on the 
SP Sepharose Fast Flow resin (GE Healthcare) followed by affinity purification 
with Ni-NTA (Qiagen). DDM (Affimetrix) was used for purification of solubilized 
SecYEG, which has been shown to be fully functional in binding RNC (D.A. &S.S., 
manuscript in preparation)**, in mediating nascent peptide translocation*’, and 
stimulating SecA ATPase activity’’. 
Fluorescence labelling. Single-cysteine mutants of Ffh and FtsY were labelled 
with Cy3-maleimide (GE Healthcare) as described in ref. 12. Protein concentra- 
tion during labelling was 50-100 1M, and the dye was in tenfold molar excess. 
Labelling reaction was carried out in buffer A (50 mM potassium-HEPES, pH 7.0, 
300 mM NaCl, 2mM EDTA, 10% glycerol) with gentle shaking at room tempera- 
ture (22 °C) for 2 h. Unconjugated dyes were removed by gel-filtration chromato- 
graphy using Sephadex G-25 resin (Sigma). Mass spectrometry confirmed >95% 
labelling efficiencies. Fluorescence labelling and modifications of the SRP RNA for 
surface immobilization (Fig. la and Supplementary Fig. 1b) did not affect the 
activity of SRP and FtsY (Supplementary Fig. 1d). 

Fluorescent DNA probes for hybridization with the mRNA on RNC were pre- 
pared by incubating NH-modified DNA oligonucleotides (Integrated DNA 
Technologies) with a tenfold excess of Alexa Fluor 647 carboxylic acid succinimi- 
dyl ester (Invitrogen) for 1 h at 37 °C. Excess dyes were removed by HPLC. 
RNA preparation. Wild-type SRP RNA was expressed in vivo and purified as 
described in ref. 8. SRP RNAs for smFRET experiments (smRNA; Supplementary 
Fig. 1b) were prepared by in vitro transcription using T7 polymerase according to 
the Megascript protocol (Ambion). The 3’-end of the SRP RNA coding sequence 
was fused to that of an HDV ribozyme (sequence: 5'-GGGCGGCATGGTCCC 
AGCCTCCTCGCTGGCGCCGCCTGGGCAACATTCCGAGGGGACCGTCCC 
CTCGGTAATGGCGAATGGGACC-3’). Self cleavage of the HDV ribozyme 
occurred during in vitro transcription to generate a homogeneous 3’-end of 
the SRP RNA. Purified smRNA was annealed to a complementary DNA splint 
by the following procedures: (1) heat the TE buffer (10mM Tris-HCl, pH7.0, 
2mM EDTA) containing 10 4M DNA and 20M smRNA for 5 min at 75 °C; 
(2) gradually cool to 50 °C over a period of 30 min; (3) add 12 mM MgCl, to the 
mixture; and (4) gradually cool to room temperature over a period of 30 min. The 
annealed DNA-smRNA hybrids were stored at —80 °C. 

mRNAs for in vitro translation were generated by in vitro transcription using T7 
(for RNC preparation) or SP6 (for targeting assays) polymerase following the 
Megascript protocol (Ambion). 

Preparation of RNCs. Synchronized RNCs with defined nascent chain length and 
sequence were prepared as described previously in ref. 31. In brief, mRNAs encod- 
ing a Strep; tag at the amino terminus, the first 74 amino acids of FtsQ or lucifer- 
ase, and a SecM translation-stall sequence at the carboxy terminus were translated 


by S100 extract as described in ref. 31. RNC from the translation mixture was 
purified by affinity chromatography using the Strep-Tactin resin (IBA), collected 
by ultracentrifugation, and re-dissolved in SRP buffer and stored at —80 °C. RNCs 
used for GTPase assay were further purified by ultracentrifugation and fractiona- 
tion on a 10-50% sucrose gradient as described in ref. 31. 

RNCp;sq was fluorescently labelled by incubation with fluorescent DNA probes 
complementary to the mRNA for 3h at room temperature. Labelled RNC was 
isolated by ultracentrifugation and re-dissolved in SRP buffer. 

Single-molecule instrument. Objective-type TIRF microscope was home-built 
based on an Olympus IX-81 model as described in ref. 19. Green (532 nm) and red 
(635 nm) lasers were focused in a X100 oil-immersed objective. Scattering light 
was removed by 560-nm and 660-nm long-pass filters (Chroma) for the green and 
red lasers, respectively. Cy3 and Quasar 670 signals were split by a dichroic mirror 
and simultaneously focused onto the iXon 897 camera (Andor) through DV2 
Dualview (Photometrics). Data were recorded at 30-ms time resolution. 
PEGylated slides and coverslips. PEGylated slides and coverslips were prepared 
based on an existing protocol’. In brief, quartz slides and coverslips were treated 
sequentially with 10% alconox, acetone and 10 M KOH. The surfaces were then 
burnt with a propane torch to remove autofluorescence. Amino-silation reactions 
were carried out in methanol with 5% (v/v) acetic acid and 1% (v/v) aminopro- 
pylsilane. PEGylation reactions were carried out in 100 mM NaHCO; buffer con- 
taining 20% (w/v) polyethylene glycol (PEG) and 0.6% (w/v) Biotin-PEG. 
PEGylated slides and coverslips were stored in vacuum at —20 °C and assembled 
into flow chambers before use. 

Single-molecule assay. To remove aggregates, all protein samples were ultra- 
centrifuged at 100,000 r.p.m. (Optima TLX, Beckman Coulter) for 1h before 
use. PEGylated slides and coverslips were assembled to form a flow chamber. 
Neutravidin (0.2mgmg ') was applied to the chamber and incubated for 
10 min before flowing in molecules of interest. 

SRP complexes were assembled in SRP buffer under the following conditions. 
SRP-FtsY complex with labelled Ffh: 1 1M DNA-smRNA hybrid, 2 1M Ffh-Cy3, 
5 uM FtsY, 100 p.M GMPPNP. SRP-FtsY complex with labelled FtsY: 1 14M DNA- 
smRNA hybrid, 2 1M Ffh, 3 uM FtsY-Cy3, 100 4M GMPPNP. RNC-SRP-FtsY 
complexes: 200 nM DNA-smRNA hybrid, 400 nM Ffh-Cy3, 500 nM RNCgsq or 
1uM RNChciferase LUM FtsY, 100M GMPPNP. SecYEG solubilized in 
0.02% DDM was added to RNCg;sqg-SRP-FtsY complex at 10 1M. The samples 
were then diluted to 50 pM in imaging buffer (SRP buffer supplemented with 
0.4% glucose and 1% Gloxy in Trolox), flowed onto the sample chamber and 
incubated for 5 min before imaging. Movies were recorded at 30-ms intervals 
for up to 3 min until most fluorescent molecules were photobleached. A red laser 
was applied at the end of the movie to confirm the presence of immobilized SRP. 
Data analysis. Single-molecule data were processed by scripts written in IDL and 
MATLAB. In brief, fluorescent peaks in the images were identified and traced 
throughout the trajectory. Traces that showed a single-donor bleaching event were 
used for data analysis. HMM was calculated using the HaMMy program”. 
Transition density map was generated by the TDP program” using the output 
from HaMMy. FRET histograms were generated using home-written script in 
MATLAB”. Transition kinetics between different states was obtained by expo- 
nential fits to dwell-time histograms. Two-dimensional scatter plots of the average 
dwell time of individual molecules during transitions were generated using the 
home-written script in MATLAB. 

GTPase assay. GTPase rate constants were determined using a well-established 
GTPase assay*. In general, reactions contained 100nM Ffh, 200nM SRP RNA, 
100M GTP (doped with y-*’P-GTP), varying concentrations of FtsY, and 
250nM RNCp;sq and 10 [1M SecYEG where applicable. Reactions were quenched 
with 0.75 M KH,PO,, pH 3.3, at different time points, separated by thin layer 
chromatography and quantified by autoradiography. 

Translocation assay. Assays for co-translational protein targeting and transloca- 
tion were carried out as described in refs 21 and 38. Reactions contained 10 ul 
in vitro translation mixtures synthesizing *°S-methionine-labelled pPL or pPL 
signal-sequence variants, to which 200nM Ffh, 333nM wild-type or mutant 
SRP RNA, 300nM FtsY and 0.5 equiv. per pl of salt-washed, trypsin-digested 
microsomal membrane was added to a total volume of 15 pl. Reactions were 
analysed by SDS-PAGE followed by autoradiography. 
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Centralspindlin links the mitotic spindle to the 
plasma membrane during cytokinesis 
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At the end of cell division, cytokinesis splits the cytoplasm of nas- 
cent daughter cells and partitions segregated sister genomes’. To 
coordinate cell division with chromosome segregation, the mitotic 
spindle controls cytokinetic events at the cell envelope. The spindle 
midzone stimulates the actomyosin-driven contraction of the cleav- 
age furrow, which proceeds until the formation of a microtubule- 
rich intercellular bridge with the midbody at its centre. The midbody 
directs the final membrane abscission reaction’” and has been pro- 
posed to attach the cleavage furrow to the intercellular bridge’. How 
the mitotic spindle is connected to the plasma membrane during 
cytokinesis is not understood. Here we identify a plasma membrane 
tethering activity in the centralspindlin protein complex, a conserved 
component of the spindle midzone and midbody*. We demonstrate 
that the C1 domain of the centralspindlin subunit MgcRacGAP asso- 
ciates with the plasma membrane by interacting with polyanionic 
phosphoinositide lipids. Using X-ray crystallography we determine 
the structure of this atypical C1 domain. Mutations in the hydro- 
phobic cap and in basic residues of the C1 domain of MgcRacGAP 
prevent association of the protein with the plasma membrane, and 
abrogate cytokinesis in human and chicken cells. Artificial membrane 
tethering of centralspindlin restores cell division in the absence of the 
C1 domain of MgcRacGAP. Although C1 domain function is dispen- 
sable for the formation of the midzone and midbody, it promotes 
contractility and is required for the attachment of the plasma mem- 
brane to the midbody, a long-postulated function of this organelle’. 
Our analysis suggests that centralspindlin links the mitotic spindle to 
the plasma membrane to secure the final cut during cytokinesis in 
animal cells. 

Despite the importance of the mitotic apparatus for cytokinesis, direct 
interactions between core midzone or midbody components and the 
plasma membrane have not been identified so far. A crucial regulator of 
cytokinesis in animal cells is a conserved heterotetrameric protein com- 
plex called centralspindlin*. Centralspindlin is composed of two mole- 
cules of the kinesin protein MKLP1 and two molecules of the Rho family 
GTPase-activating protein (GAP) MgcRacGAP°® (also known as 
RACGAP!1 or CYK4; Supplementary Fig. 1a). Centralspindlin organizes 
antiparallel arrays of microtubules at the spindle midzone and midbody, 
controls the activity of the small GTPases RhoA and Rac during furrow 
ingression’"°, and recruits cytokinetic effector proteins to promote cor- 
tical contractility and abscission''’. 

A search for potential membrane-interaction moieties among mid- 
zone and midbody components revealed an uncharacterized but evo- 
lutionarily conserved Cldomain in the centralspindlin subunit 
MgcRacGAP (Fig. 1a and Supplementary Fig. 1b). Cl domains can 
serve as protein- or lipid-binding modules’”"*. Using X-ray crystal- 
lography, we determined the structure of the Cl domain of human 
MgcRacGAP and refined it to a resolution of 2.2 A (Fig. 1b, Sup- 
plementary Fig. 2 and Supplementary Table 1; see accession details 


below). The crystal structure revealed a characteristic C1 domain fold 
with two Zn’* -coordinating centres and a B-sheet-loop configuration 
that may act as a ligand-binding interface'”'* (Fig. 1b). 

To scrutinize the function of MgcRacGAP’s C1 domain, we gener- 
ated monoclonal human HeLa Kyoto cell lines expressing short inter- 
fering RNA (siRNA)-resistant and tagged alleles of MgcRacGAP 
(Fig. la, cand Supplementary Fig. 3). Wild-type MgcRacGAP resistant 
to siRNA (MgcRacGAPr) effectively suppressed the cytokinesis failure 
caused by depletion of the endogenous counterpart (Fig. 1c). Elimin- 
ation of the MKLP1-interacting region®*® at the amino terminus of 
MgcRacGAP (MgcRacGAPr-AN) abolished the ability of the trans- 
genic protein to support cell division. Importantly, deletion of the 
Cldomain (MgcRacGAPr-AC1) in MgcRacGAP also completely 
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Figure 1 | The C1 domain of MgcRacGAP is required for cytokinesis. 

a, Transgenic MgcRacGAP system. aa, amino acid; AcGFP, Aequorea 
coerulescens green fluorescent protein; C1, cysteine-rich domain; CC, coiled 
coil; GAP, GTPase-activating protein domain. b, Structure of the C1 domain of 
human MgcRacGAP at 2.2-A resolution. Side chains of mutated structural core 
and surface residues are shown in red and green, respectively. Zn’* - 
coordinating residues are shown in yellow. c, Monoclonal HeLa Kyoto cell lines 
expressing the indicated transgenes were transfected with control or 
MgcRacGAP siRNA duplexes. Multi-nucleation in AcGFP-positive interphase 
cells 48 h after transfection (graph). Error bars, s.d. of three experiments 

(n> 200 cells each). Immunoblot analysis of cell lines is shown in 
Supplementary Fig. 3b. WT, wild-type. d, Localization of MgcRacGAPr—FLAc 
transgenes; endogenous MgcRacGAP depleted in monoclonal MgcRacGAPr- 
FLAc cell lines (1 > 50 cells each). Scale bars, 10 ttm. 
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abrogated cytokinesis (Fig. 1c). Furthermore, single point mutations in 
structural core residues (Phe289Gly and Cys316Gly) and conserved 
surface residues (Lys292Leu, Arg306Leu and Phe309Ala) (Fig. 1b) of 
MgcRacGAP’s Cl domain severely compromised cytokinesis (Fig. 1c). 
Thus, the Cl domain of MgcRacGAP has a critical role during cell 
division in human cells. 

Biochemical and cellular analyses revealed that MgcRacGAP’s 
C1 domain was dispensable for the assembly of the centralspindlin 
complex (Supplementary Fig. 4a), the localization of centralspindlin 
(Fig. 1d and Supplementary Fig. 4b), and the organization of micro- 
tubule arrays at the midzone and midbody (Fig. 1d). It was also 
dispensable for the midzone and midbody localization of known 
cytokinetic regulators (Supplementary Fig. 4c). In marked contrast, 
MgcRacGAPr-AN abolished the formation of the centralspindlin 
complex and severely impaired the organization of the midzone and 
midbody (Fig. 1d and Supplementary Fig. 4a, c). Thus, the assembly of 
the spindle midzone and midbody requires formation of the central- 
spindlin complex but not MgcRacGAP’s C1 domain, suggesting that 
this domain encodes a novel function of centralspindlin that is essen- 
tial for cytokinesis. 

Based on the ability to interact with diacylglycerol (DAG) and 
phorbol ester DAG mimetics, C1 domains are divided into two func- 
tional groups, typical and atypical'’. Acute treatment with the phorbol 
ester 12-O-tetradecanoylphorbol-13-acetate (TPA) caused the rapid 
translocation of the typical C1B domain of human protein kinase Ca 
(PKCa) to the plasma membrane (Fig. 2a). In contrast, the C1 domain 
of MgcRacGAP remained cytosolic (Fig. 2a and Supplementary Video 
1). Consistent with structural considerations (Supplementary Fig. 2c), 
the C1 domain of MgcRacGAP does not respond to phorbol esters and 
belongs to the atypical family. 

Centralspindlin contains two molecules of MgcRacGAP and can 
form higher-order clusters®’’. Therefore, we tested the subcellular 
distribution of tandem MgcRacGAP C1 domains. Fluorescently tagged 
head-to-tail fusion proteins of two or four C1 domains (C1,, and C14) 
localized to the plasma membrane independently of the actomyosin 
cortex (Fig. 2b and Supplementary Fig. 5a; see also Methods). Purified 
glutathione S-transferase (GST)-Cl protein bound to the phos- 
phoinositide lipids phosphatidylinositol-4-phosphate (PtdIns(4)P), 
phosphatidylinositol-4,5-bisphosphate (PtdIns(4,5)P2), phosphatidy- 
linositol-3,4,5-trisphosphate (PtdIns(3,4,5)P3), and to phosphatidic 
acid in vitro (Supplementary Fig. 5b). Ca**- and ionomycin-induced 
depletion of PtdIns(4)P and PtdIns(4,5)P, in vivo” triggered the rapid 
release of the Cl,, protein from the plasma membrane (Fig. 2c). To 
selectively hydrolyse phosphoinositides at the plasma membrane we 
used the hybrid lipid phosphatase pseudojanin (PJ)”°. Pseudojanin 
consists of the polyphosphate-5-phosphatase E (INPP5E), which con- 
verts PtdIns(4,5)P> to PtdIns(4)P, and the Saccharomyces cerevisiae 
Sacl phosphatase (Sac), which dephosphorylates PtdIns(4)P. Rapamycin- 
controlled membrane targeting of pseudojanin, but not of its cataly- 
tically inactive counterpart (PJ-dead), caused the rapid and complete 
release of the Cl,, protein from the cell envelope (Fig. 2d). Individual 
elimination of PtdIns(4)P and PtdIns(4,5)P> elicited by membrane 
recruitment of only active Sac or INPP5E, respectively, caused a partial 
displacement of C1,, from the plasma membrane (Fig. 2d). Together, 
these results suggest that MgcRacGAP’s C1 domain associates with the 
inner leaflet of the plasma membrane by binding to PtdIns(4)P and 
PtdIns(4,5)P,, two polyanionic phosphoinositides that contribute to 
plasma membrane identity”. Consistent with this conclusion, surface 
plasmon resonance (SPR) experiments revealed that PtdIns(4)P and 
PtdIns(4,5)P. stimulated the interaction of the purified C1 domain 
with liposomes in vitro (Fig. 2e and Supplementary Fig. 5c). 

Importantly, introduction of the C1 domain point mutations that 
abrogate cytokinesis (Fig. 1) into the tandem C1), fusion abolished the 
protein’s membrane localization (Fig. 2b). The surface mutation 
Lys292Leu also eliminated the association of the purified C1 domain 
with PtdIns(4)P- or PtdIns(4,5)P2-containing liposomes in SPR 
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Figure 2 | Plasma membrane association of the C1 domain of MgcRacGAP. 
a, HeLa Kyoto cells expressing the typical C1B domain of human PKCa (C1B- 
FLAc) or the C1 domain of MgcRacGAP (C1-FLAc) were recorded before (0 s) 
and 200s after treatment with 1 uM TPA. b, Analysis of HeLa Kyoto cells 
expressing the indicated C1-FLAc variants (n > 30 cells). ¢, HEK293T cells 
expressing C1,,-FLAc were recorded before (—) and 200s after (+) treatment 
with 1 mM CaCl, and 10 uM ionomycin. Quantification of C1,,-FLAc 
membrane localization (n = 15 cells each). Error bars, s.d. d, HEK293T cells co- 
expressing Cl,,-FLAc, Lyn11-FRB-mCherry and one of four monomeric red 
fluorescent protein (mRFP)-FKBP-tagged lipid phosphatases (pseudojanin 
(PJ), Sac, INPP5E or the catalytically inactive counterpart of PJ (PJ-dead))”° 
were recorded before (0s) and 200s after treatment with 10 1M rapamycin. 
Quantification of C1,,-FLAc membrane localization (n = 15 cells each). Error 
bars, s.d. e, Surface plasmon resonance analysis of the binding of purified 
proteins (250 nM) to phosphatidylethanolamine vesicles containing 5% 
PtdIns(4)P or PtdIns(4,5)P>. Binding to phosphatidylethanolamine vesicles 
was used as a reference. f, Model of MgcRacGAP C1 domain-membrane 
interaction. Scale bars, 10 um. 


measurements (Fig. 2e). Computational analysis” of MgcRacGAP’s 
C1 domain structure suggests that the protein interacts with a lipid 
bilayer through a hydrophobic cap that penetrates into the hydrocar- 
bon core (Fig. 2f). This configuration also positions several basic sur- 
face residues within the interface region where they could engage with 
negatively charged lipid heads, such as those of PtdIns(4)P and 
PtdIns(4,5)P>. This interpretation is substantiated by the fact that 
mutations in the hydrophobic cap (Phe309Ala) and in basic residues 
(Lys292Leu and Arg306Leu) abrogate membrane association of the 
C1 domain (Fig. 2b, e). The strict correlation between the failure to 
support cell division and membrane localization in the mutant alleles 
raised the possibility that plasma membrane binding could be the key 
function of MgcRacGAP’s C1 domain during cytokinesis. 

To test this hypothesis, we replaced the atypical C1 domain in 
MgcRacGAP with the typical C1B domain of human PKCa 
(MgcRacGAPr-C1B) (Fig. 3a). This enabled us to chemically control 
the membrane association of the hybrid protein by the addition 
of phorbol esters (Supplementary Fig. 6a, b). After depletion of the 
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Figure 3 | Artificial plasma membrane tethering of MgcRacGAP restores 
cytokinesis in the absence of C1 domain function. a—c, Domain organization 
of hybrid proteins (a, c). Time-lapse series and quantification of cytokinesis 
failure in cells transfected with MgcRacGAP siRNA (b). Dotted arrow, point at 
which DMSO or TPA was added. Error bars, s.d. of two experiments 

(n> 112 cells each). d, Multi-nucleation in AcGFP-positive interphase cells. 
Images depict MgcRacGAP siRNA-transfected cells. None, protein 
MgcRacGAPr-AC1-FLAc does not contain any GEM fusion. Error bars, s.d. of 
three experiments (m > 200 cells each). Scale bar, 10 um. 


endogenous MgcRacGAP protein, cells stably expressing MgcRacGAPr- 
C1B were released from a metaphase arrest and treated with either 
dimethylsulphoxide (DMSO) or the phorbol esters TPA (2.5nM) and 
phorbol-12,13-dibutyrate (30 nM PDBu). Although DMSO-treated cells 
failed to divide, acute addition of phorbol esters restored successful 
cytokinesis in more than one-half of the MgcRacGAPr-C1B-hybrid- 
expressing cells (Fig. 3b). Mutation of a Zn’* -coordinating cysteine in 
the C1B domain (Cys31Gly) abolished the phorbol-ester-dependent 
membrane translocation and rescue activity of the hybrid protein 
(Fig. 3b and Supplementary Fig. 6a, b). Next, we inserted the polybasic 
tail of the human GTPase GEM at the carboxyl terminus of a version of 
MgcRacGAP that lacks the Cl domain (MgcRacGAPr-AC1-GEM) 
(Fig. 3c). Targeting of the tail of GEM to the plasma membrane involves 
PtdIns(4,5)P2 and PtdIns(3,4,5)P3 lipids”. Insertion of GEM’s polybasic 
sequence restored successful cell division in the absence of MgcRacGAP’s 
C1 domain (Fig. 3d and Supplementary Fig. 6c). Mutation of two hydro- 
phobic residues in GEM’s tail to alanine (Phe-Trp to Ala-Ala) abolished 
both the ectopic membrane association (Supplementary Fig. 6c, d) and 
the rescue activity of the hybrid protein (Fig. 3d). Insertion of GEM’s 
polybasic tail had no effect on the penetrance of cell-division failure in 
the MgcRacGAP-AN allele that cannot interact with MKLP1 (Sup- 
plementary Fig. 7). Thus, artificial tethering of MgcRacGAP to the 
plasma membrane specifically bypasses the requirement for C1 domain 
function during cytokinesis. This shows that the atypical C1 domain of 
MgcRacGAP mediates plasma-membrane tethering of centralspindlin to 
control cytokinesis. 

Complete loss of MgcRacGAP abrogated cleavage-furrow formation 
and the accumulation of RhoA and the contractile ring component 
anillin at the equator (Supplementary Fig. 8a-d). Mutations in 
MgcRacGAP’s C1 domain supported full furrow ingression with normal 
kinetics in the majority of cells but caused furrow retraction and cyto- 
kinesis failure within approximately 50 min after anaphase onset (Sup- 
plementary Fig. 8a, d, e). In contrast, cells depleted for the abscission 
regulator CEP55 remained connected for up to 9h (Supplementary 
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Fig. 8e). Also, mutation of the catalytic arginine finger in the GAP 
domain of MgcRacGAP (Arg385Ala) had a far less pronounced effect 
on the execution of cytokinesis than point mutations within the protein’s 
Cl domain (Supplementary Fig. 9a). These observations suggest that 
facilitating GTP hydrolysis or cytokinetic abscission do not represent 
the primary function of centralspindlin’s membrane association. Quan- 
tification of the intensity of RhoA and anillin revealed a minor reduction 
in the width of the contractile zone in MgcRacGAP C1 domain mutant 
alleles, probably accounting for the inability of a fraction of cells to 
complete furrow ingression (Supplementary Fig. 8a—c). Thus, although 
membrane tethering of centralspindlin contributes to the stimulation of 
contractility by the spindle midzone*'°'*»*, it becomes critically 
important for cytokinesis soon after completion of furrowing. 

Live-cell imaging in cells complemented with wild-type MgcRacGAP 
showed that the plasma membrane remained associated with the mid- 
body until abscission (Fig. 4a and Supplementary Video 2) (27 out of 
30 cells). MgcRacGAP proteins carrying the Cl domain mutations 
Lys292Leu or Phe289Gly also accumulated at the midbody after furrow 
ingression. However, the plasma membrane subsequently detached 
from the mutant midbody leading to furrow retraction and cytokinesis 
failure (Fig. 4a and Supplementary Video 2) (19 out of 24 cells). Trans- 
mission-electron-microscopy analysis of telophase cells frequently 
revealed areas of cytoplasmic density between the midbody matrix 
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Figure 4 | Persistent association of the plasma membrane with the midbody 
requires the C1 domain of MgcRacGAP. a, Confocal imaging of cell lines 
stably co-expressing MyrPalm—mCherry and the indicated transgenes. Cells 
were transfected with MgcRacGAP siRNA. Arrowheads indicate detachment of 
the plasma membrane from the midbody. b, Transmission electron 
micrographs (TEMs) of cells stably expressing the indicated transgenes. Cells 
were transfected with MgcRacGAP siRNA. Arrowheads indicate gaps between 
the midbody matrix and the plasma membrane (n > 23 midbodies). 

c, MgcRacGAP knockout DT40 cells expressing a mouse MgcRacGAP rescue 
transgene from tetracycline-inducible promoter. Constitutively expressed 
mCherry-Flag-tagged chicken MgcRacGAP transgenes were introduced into 
the conditional knockout cells. Multi-nucleation in mCherry-positive cells in 
monoclonal lines. Error bars, s.d. of three experiments (nm > 200 cells each). 

d, Model showing how centralspindlin could tether microtubules to the plasma 
membrane. Scale bars, 10 tm. 
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and the plasma membrane in MgcRacGAP C1 domain mutant alleles 
but not wild-type cells (Fig. 4b and Supplementary Fig. 9b). These 
results suggest that the interaction of MgcRacGAP’s Cl domain with 
the plasma membrane promotes the anchoring of the cell envelope to 
the midbody. 

To extend our observations to another species, we used a con- 
ditional knockout system for MgcRacGAP in chicken DT40 lympho- 
cytes**. Mutation of conserved structural (Phe288Gly) and surface 
(Lys291Leu) residues in the C1 domain of chicken MgcRacGAP lead 
to severe cytokinesis failure in DT40 cells (Fig. 4c and Supplementary 
Fig. 9c). Homologous mutations abolished Cl domain membrane 
binding and cytokinesis in human cells (Figs 1 and 2), suggesting that 
the link between centralspindlin and the plasma membrane is a con- 
served and essential feature of cell division in animal cells. 

The midbody matrix has been proposed to act as a ‘glue’ that may 
hold microtubule bundles and the plasma membrane together’. 
Experiments in Drosophila cells indicated that structures other than 
those derived from the contractile ring are responsible for anchoring 
the cleavage furrow to the midbody~*. Our data suggest that coopera- 
tive binding of centralspindlin’s C1 domain to the plasma membrane 
can provide an essential link between midzone-midbody microtubule 
arrays and the cell envelope during cytokinesis in animal cells (Fig. 4d). 
This interaction promotes cortical contractility and the anchoring of 
the cleavage furrow to the midbody. The centralspindlin-membrane 
connection is likely to act synergistically with other processes to main- 
tain the long-term stability of the intercellular bridge'*”°**. These 
mechanisms allow the abscission reaction to sever the intercellular 
bridge long after cleavage-furrow ingression and help to prevent the 
catastrophic consequences of cytokinesis failure”. 


METHODS SUMMARY 

Monoclonal cell lines. Flag~AcGFP-tagged (FLAc; AcGFP is a variant of green 
fluorescent protein cloned from Aequorea coerelescens) variants of MgcRacGAP 
were inserted into pIRESpuro3 (Clontech). Plasmids were transfected into HeLa 
Kyoto cells using FuGENE 6 (Roche). Cells were grown in medium supplemented 
with 0.4 1g ml’ puromycin (Sigma) to select for and maintain monoclonal trans- 
genic cell lines. For all alleles used here, the results obtained in monoclonal cells 
lines were validated using independently isolated pools of transgenic cells (data not 
shown). To generate cell lines co-expressing pIRESneo3-based MyrPalm-mCherry, 
the medium was further supplemented with 500 pg ml” ' G418 (Sigma). 

siRNA transfection. The following siRNA duplexes were used at a final concen- 
tration of 20 nM: control (Invitrogen Stealth 12935-200), MgcRacGAP (Invitrogen 
Stealth HSS120934) (5’'-GCCAAGAACUGAGACAGACAGUGUG-3’), CEP55 
(Invitrogen Stealth HSS182896) (5’-ACGAAUUGCUGAACUUGAAAGC 
AAA-3’). Lipofectamine RNAiMAX (Invitrogen) was used for siRNA transfec- 
tion. Cells were analysed 48 h after transfection with siRNA duplexes. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Protein purification for use in X-ray crystallography. GST-Cl (human 
MgcRacGAP residues 284 to 338) was expressed in Escherichia coli (BL21 
(DE3) codon plus RIL) in the presence of 101M ZnCl, and purified using 
Glutathione Sepharose 4 Fast Flow (GE Healthcare). The Cldomain of 
MgcRacGAP was released from GST by digestion with human rhinovirus 14 3C 
protease and bound to a 1-ml HiTrap SP Sepharose column (GE Healthcare). The 
protein, eluted with a linear 50-500-mM NaCl gradient in 25mM Tris-HCl, 
pH 7.2 (Supplementary Fig. 2a), was dialysed overnight against 250 mM NaCl, 
1mM DTT (dithiothreitol), 50 uM ZnCl, 25 mM Tris-HCl, pH 7.2, and concen- 
trated to 8-9 mg ml! using an Amicon Ultracel-3k device (Millipore). 
Crystallization and structure determination. Crystals of MgcRacGAP’s 
Cl domain were grown at 18°C by vapour diffusion in hanging drops of 1 pl 
protein plus 1 pl reservoir solution containing 1.3 M sodium citrate, pH 7.1, and 
0.3M_ dimethylethylammonium propane sulfonate (NDSB-195, Hampton 
Research). Crystals, briefly soaked in 10% glycerol, 1.3 M sodium citrate, pH 7.1, 
were cryo-cooled by rapid immersion into liquid nitrogen. X-ray diffraction was 
measured at 100 K on 102 beamline of the Diamond Light Source. A severe 
intensity modulation present in the data is explained by pseudotranslation 
(two-thirds of the unit cell were related by a translational non-crystallographic 
symmetry). To measure intensities of the weak reflections with greater precision, 
highly redundant data were collected. The intensities, integrated using XDS” were 
scaled and merged in Scala*’. Although the data could be processed assuming 
space group C222, with unit cell dimensions of a= 36.85 A, b= 107.15 A, 
c = 133.84 A, analysis of Padilla and Yeates L-test plots** provided strong evidence 
for twinning. Hence, the data were processed in the lower symmetry group P2, 
(Supplementary Table 1). The free R flags were selected using 5% of the data in 
Phenix”, taking into account lattice symmetry. The structure was initially solved 
by a combination of single-wavelength anomalous dispersion and molecular 
replacement, using a data set acquired at an X-ray energy of 9.663 keV (wavelength 
1.283 A), corresponding to the K-absorption edge of Zn (Supplementary Table 1). 
The molecular replacement model was generated using Phenix Scupltor’’ based on 
the crystal structure of the protein kinase Cd Cl domain (Protein Data Bank 
(PDB) accession 1PTQ). Positions of Zn atoms were located using SHELXD™, 
and the substructure was refined in SHARP”. The resulting map, improved by 
solvent flipping in Solomon**, showed density for six copies of C1 domain per 
asymmetric unit. Four of the copies were fitted using Molrep”’, and the remaining 
two were fitted manually in Coot**. The structure was refined in Phenix taking 
into account the pseudomerohedral twin law h,-k,-h-l, twin fraction 50%. 
Experimental phase restraints were used at the initial stages of refinement. 
Following iterative model improvement in Coot and least squares refinement in 
Phenix, the model was refined to a resolution of 2.2 A with an R of 21.03% and Rie 
of 23.55% and good geometry (Supplementary Table 1). Examples of the final 
weighted 2F,-F- electron density map are shown in Supplementary Fig. 2b. The 
coordinates and structure factors have been deposited with the PDB under acces- 
sion code 4B6D. 

Plasmids for expression in human cells. To create pIRESpuro3-FLAc, a single 
Flag epitope (DYKDDDDK) and AcGFP (AcGFP, Aequorea coerulescens green 
fluorescent protein; Clontech) were inserted into pIRESpuro3 (Clontech) using 
EcoRI and BamHI. MgcRacGAP variants (NM_013277) were inserted into 
pIRESpuro3-FLAc using Agel and EcoRI. To create an siRNA-resistant allele of 
MgcRacGAP, the sequence AAGAACTGAGACAGACAGT was changed to 
CCGGACCGAGACCGATTCC by introducing silent mutations. MgcRacGAP 
variants were created using polymerase chain reaction (PCR) and site-directed 
mutagenesis. To create MgcRacGAPr-C1B, the residues 287 to 335 of 
MgcRacGAP were replaced by the C1B domain of human PKCa (amino acids 
102 to 151) (provided by P. Parker). To create the MgcRacGAP-AC1-GEM, a 
DNA fragment encoding the 33 C-terminal amino acids of human GEM 
(RKARRFWGKIVAKNNKNMAFKLKSKSCHDLSVL) was inserted after 
MgcRacGAP using EcoRI. The sequence MGCIKSKGKDS was inserted at the N 
terminus of Flag-mCherry to create MyrPalm-Flag-mCherry in pIRESneo3 
(Clontech). H2B-mCherry (Addgene plasmid no. 21044)” and mCherry- 
UtrCH (Addgene plasmid no. 26740)*° transgenes were obtained from 
Addgene. To create C1-FLAc, MgcRacGAP’s Cl domain (MgcRacGAP residues 
281 to 341), was inserted into pIRESpuro3-FLAc using Agel and EcoRI. To create 
tandem Cl2,-FLAc, an additional domain was inserted into pIRESpuro3-Cl- 
FLAc using Nhel and Agel. To create Cl4,-FLAc, tandem Cl domains were 
inserted into pIRESpuro3-Cl5,-FLAc using EcoRV and Nhel. To create C1B- 
FLAc, a single C1B domain of human PKCa (PKC residues 99 to 154) was 
inserted into pIRESpuro3-FLAc using Agel and EcoRI. For transient expression 
experiments in Fig. 2a-d and Supplementary Fig. 6b, HeLa Kyoto and HEK293T 
cells were transfected using FuaGENE HD (Promega) and analysed 24h to 48h 
post transfection. 


Monoclonal cell lines. Flag~AcGFP-tagged (FLAc) variants of MgcRacGAP were 
inserted into pIRESpuro3 (Clontech). Plasmids were transfected into HeLa Kyoto 
cells using FuGENE 6 (Roche). Cells were grown in medium"! supplemented with 
0.4 1g ml’ puromycin (Sigma) to select for and maintain monoclonal transgenic 
cell lines. For all alleles used here, the results obtained in monoclonal cells lines 
were validated using independently isolated pools of transgenic cells (data not 
shown). To generate cell lines co-expressing pIRESneo3-based MyrPalm- 
mCherry the medium was further supplemented with 500 pg ml ' G418 (Sigma). 
siRNA transfection. The following siRNA duplexes were used at a final concen- 
tration of 20 nM: control (Invitrogen Stealth 12935-200), MgcRacGAP (Invitrogen 
Stealth HSS120934) (GCCAAGAACUGAGACAGACAGUGUG), CEP55 (Invi- 
trogen Stealth HSS182896) (ACGAAUUGCUGAACUUGAAAGCAAA). Lipo- 
fectamine RNAiMAX (Invitrogen) was used for siRNA transfection. Cells were 
analysed 48 h after transfection with siRNA duplexes. 

Immunofluorescence microscopy. Cells were fixed for 16 h in —20°C methanol, 
for 15 min at 25°C in 4% formaldehyde (Fig. 4c and Supplementary Fig. 9c) or for 
15 min on ice in 10% tricholoroacetic acid (Supplementary Fig. 8b) before being 
processed for immunofluorescence microscopy as described’. Images for Figs 1c, 
d, 3d and 4c and Supplementary Figs 4b, c, 7 and 9c were acquired on a Zeiss Axio 
Imager M2 microscope using a Plan-Neofluar X40/1.3 numerical aperture (NA) 
oil objective lens (Zeiss) or Plan Apochromat X63/1.4 NA oil objective lens (Zeiss) 
equipped with an ORCA-ER camera (Hamamatsu) and controlled by Volocity 
6.0.1. software (Improvision). Images in Figs 1d and 4c and Supplementary 
Fig. 4b,c, 9c are displayed as maximum-intensity projections of deconvolved Z 
planes that were acquired in 100-nm sections. Images in Figs 1d and 4c and Sup- 
plementary Figs 4b, c and 9c were deconvolved using Volocity’s iterative restora- 
tion function. Images for Supplementary Fig. 8b, c were acquired on an Olympus 
FV1000D (InvertedMicroscopeIX81) laser confocal scanning microscope using a 
PlanApoN 60/1.40 NA Oil Sc objective lens controlled by FV10-ASW software. 
Antibodies and dyes. The following primary antibodies were used: mouse mono- 
clonal anti- AcGFP (JL8, Clontech, immunofluorescence microscopy 1:2,000; blot 
1:1,000), rabbit anti- AcGFP (Clontech, immunofluorescence microscopy 1:2,000), 
mouse monoclonal anti-c-tubulin (B512, Sigma, immunofluorescence micro- 
scopy 1:10,000, blot 1:30,000), rat monoclonal anti-x-tubulin (MCA78G, AbD 
Serotec, immunofluorescence microscopy 1:1,000), rabbit anti-anillin (provided 
by M. Glotzer, immunofluorescence microscopy 1:2,000)**, mouse monoclonal 
anti-Aurora B (AIM-1, BD Transduction Laboratories, immunofluorescence 
microscopy 1:500), mouse polyclonal anti-CEP55 (B01P, MaxPab, immunofluor- 
escence microscopy 1:500), rabbit anti-ECT2 (raised against ECT2 amino acids 1 
to 421, immunofluorescence microscopy purified serum 1:700)'*, mouse mono- 
clonal anti-GAPDH (6C5, Abcam, blot 1:30,000), rabbit anti-GST (provided by J. 
Gannon, blot 1:1,000), rabbit anti-lamin B (ab16048, Abcam, immunofluore- 
scence microscopy 1:500), mouse monoclonal anti-mCherry (Clontech, immuno- 
fluorescence microscopy 1:1,000; blot 1:300), mouse monoclonal anti- MKLP1 (24, 
BD Biosciences, blot 1:1,000), rabbit anti- MKLP1 (N-19, Santa Cruz, immuno- 
fluorescence microscopy 1:500), mouse monoclonal anti-MgcRacGAP (M01, 
Abnova, blot 1:50,000), mouse monoclonal anti-PLK1 (F-8, Santa Cruz, immuno- 
fluorescence microscopy 1:100), rabbit polyclonal anti-PRC1 (H-70, Santa Cruz, 
immunofluorescence microscopy 1:500) and mouse monoclonal anti-RhoA 
(26C4, Santa Cruz, immunofluorescence microscopy 1:75). Secondary antibodies 
conjugated to Alexa Fluor 488, 568, 594 or 647 (Molecular Probes) were used for 
immunofluorescence microscopy detection. DNA was stained with 1 pg ml! 
4',6-diamidino-2-phenylindole (DAPI; Molecular Probes) and with 0.2 pg ml! 
Hoechst 33342 (Molecular Probes) in live cells. 

Co-immunoprecipiation. Thirty-four hours after transfection with MgcRacGAP 
siRNA, HeLa Kyoto cells stably expressing MgcRacGAP-FLAc transgenes were 
arrested in 50ngml * nocodazole for 14h. Cells were lysed in buffer containing 
20 mM Tris, pH 7.5, 150 mM NaCl, 0.1% Triton X-100, 5 mM MgCl anda protease 
inhibitor cocktail (Roche). Proteins were collected using anti-Flag M2 antibody- 
conjugated agarose beads (Sigma) and analysed by immunoblotting. 

Live-cell imaging. Before recording of live cells, the medium was changed to 
phenol-red-free CO -independent medium’. Images in Supplementary Fig. 8a 
and Fig. 3b were acquired at 37°C using a Zeiss Axio Observer Z1 microscope 
controlled by SimplePCI software (Hamamatsu) and equipped with an Orca 
03GO1 camera (Hamamatsu) and a Plan-Apochromat X10/0.45 NA objective. 
The images for Supplementary Fig. 6b were acquired on the Axio Observer Z1 
system with an EC Plan-Neofluor X40/1.30 NA oil DICII objective. For Sup- 
plementary Video 1, Fig. 2a-d and Supplementary Figs 5a and 6b, d, images were 
acquired at 37°C using a Olympus FV1000D (InvertedMicroscopeIX81) laser 
confocal scanning microscope equipped with a PlanApoN X60/1.40 NA Oil Sc 
objective lens and controlled by FV10-ASW software (Olympus). For Fig. 4a and 
Supplementary Video 2, frames were acquired at 37°C using a PerkinElmer ERS 
Spinning disc system equipped with a Nikon TE2000 microscope, Apo TIRF X 100/ 
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1.49 NA (Nikon), a CSU22 spinning disc scanner (Yokogawa), an IEEE1394 Digital 
CCD C4742-80-12AG camera (Hamamatsu) and controlled by Volocity 5.5.1 soft- 
ware (Perkin Elmer). 

Image quantification. For Fig. 2c, d ImageJ software (http://rsbweb.nih.gov/ij/) 
was used to measure mean AcGFP intensities by averaging 6 manually placed 
circular regions of 9 pixels at the cell periphery and 2 circular regions of 1,000 to 
2,000 pixels in the cytoplasm followed by subtraction of the mean background 
signal outside of the cell. For Supplementary Fig. 8b, c, FV10-ASW software was 
used to quantify 16-bit images along the periphery of anaphase cells (n > 10). To 
measure the rate of cleavage-furrow ingression (Supplementary Fig. 8d), differential 
interference contrast (DIC) images of cells were recorded every 30s. Cleavage- 
furrow ingression speed was calculated by measuring the cell width at the equatorial 
plane using ImageJ. Values were normalized to the cell width at metaphase. 
Lipid-array binding assay. GST-C1 (human MgcRacGAP residues 284 to 338) 
fusions were expressed in E. coli (BL21) in the presence of 5 uM ZnCl, and purified 
using Glutathione Sepharose 4 Fast Flow (GE Healthcare). Membrane lipid arrays 
(P-6002, Echelon Biosciences) were incubated with 70 ng ml ' GST-C1 fusion 
proteins or 50ng ml ' GST-GRP1-PH domain (G-3901, Echelon Biosciences) 
in PBS containing 0.1% Tween-20 and 3% BSA for 1h at 25°C. After washing, 
proteins were detected using anti-GST antibodies. 

Manipulation of lipids in vivo. HEK293T cells were treated with 1 mM CaCl, 
and 104M ionomycin (Sigma) in Opti-MEM (Invitrogen). The rapamycin- 
controlled membrane targeting system of lipid phosphatases is described in 
detail in ref. 20. HEK293T cells were transfected with pIRESpuro3-Cl,,-FLAc, 
pmCherry-N1-Lyn11-FRB and one of four different lipid phosphatases (pmRFP- 
C1-FKBP-PJ, pmRFP-C1-FKBP-PJ-Sac, pmRFP-C1-FKBP-INPP5E or pmRFP- 
C1-FKBP-PJ-dead) with a ratio of 6:1:1. Rapamycin (10 1M; Sigma) was applied to 
cells in Opti- MEM. 

Surface plasmon resonance. Surface plasmon resonance (SPR) (Fig. 2e) was carried 
out using the Bio-Rad Protein Interaction Array System ProteOn XPR36. Phos- 
phatidylethanolamine (95%) in chloroform was mixed with L-o-phosphatidyl-1, 
2-dipalmitoyl-phosphoinositides PtdIns(4)P or PtdIns(4,5)P2 (5%) as indicated. 
Chloroform mixtures were dried, resuspended in 10 mM Tris, pH 7.4, and sonicated 
into solution. Equal levels of sonicated lipid solutions were loaded, as determined by 
changes in SPR response units, onto a GLM sensor chip (Bio-Rad) derivatized with 
undecylamine to act as a vesicle capture tether. Recombinant purified GST-tagged 
proteins (Supplementary Fig. 5c) were diluted in PBS to a concentration of 250 nM 
and analyte injections were carried out at a flow rate of 50 il min! and with a sensor- 
chip temperature of 25°C. Resonance units for SPR signals are presented after 
subtraction of the resonance units observed in the phosphatiylethanolamine-only 
vesicle-reference channel. 

Prediction of C1 domain-membrane interaction. The interaction of MgcRacGAP’s 
C1 domain and a lipid bilayer was predicted using the atomic coordinates of the C1 
domain (PDB accession 4B6D) and the Positioning of Integral and Peripheral 
Proteins in Membranes (PPM) server?! 

Phorbol ester treatment. For the experiment shown in Fig. 3a, b cells were 
transfected with MgcRacGAP siRNA and arrested in metaphase using the pro- 
teasomal inhibitor MG132 (10 1M; Sigma) for 2h, then synchronously released 
into anaphase as described". Sixty-five minutes after release from MG132, cells 
were treated with 2.5nM TPA (Sigma) or 30nM PDBu (Sigma). 

Electron microscopy. Cells expressing FLAc-tagged variants of human 
MgcRacGAP were plated on 13-mm cover glasses (Gerhard Menzel, Thermo 
Scientific) coated with 0.01% poly-t-lysine solution (SigmaAldrich). After transfec- 
tion with MgcRacGAP, siRNA cells were synchronized at anaphase as described". 
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Cells were fixed 100 min after release from metaphase in 1.5% glutaraldehyde and 
2% paraformaldehyde in 0.1 M phosphate buffer, pH 7.4, for 45 min. The samples 
were post-fixed in reduced osmium tetroxide, stained with tannic acid, dehydrated 
step-wise to 100% ethanol and embedded in epon. Ultrathin sections of 80 nm were 
cut, post-stained with lead citrate and viewed in a Tecnai Spirit Biotwin 120 keV 
transmission electron microscope (FEI Company). Images were captured using an 
Orius CCD (Gatan) with Digital Micrograph software (Gatan). 

DT40 cell methods. Chicken MgcRacGAP (XM_424490) was amplified from 
total DT40 RNA (provided by H. Hochegger) using the OneStep RT-PCR Kit 
(QIAGEN). Point mutations were introduced using site directed mutagenesis. 
Chicken MgcRacGAP variants were inserted into pmCherry-Flag using Agel 
and EcoRI sites. Parental and targeted DT40 cells” (provided by M. Hikida and 
M. Kurosaki) were grown in RPMI1640 (Invitrogen) supplemented with 50 uM 
2-mercaptoethanol (Sigma), 10% fetal bovine serum (Sigma), 1% chicken serum 
(Sigma), 2 mM L-glutamine and antibiotics (Invitrogen) at 37 °C in 5% CO>. The 
medium was supplemented with 100ngml ' tetracycline (Sigma) to maintain 
expression of the Flag-tagged mouse MgcRacGAP rescue gene in the conditional 
knockout cell line. pmCherry-Flag-based plasmids were introduced into DT40 
cells using a Bio-Rad Gene Pulser Xcell Electroporation system (settings: 300 V, 
950 WF). Puromycin resistance was introduced by co-electroporating cell with the 
plasmid pLoxPuro™. pmCherry-Flag-based plasmids and pLoxPuro plasmid were 
linearized before electroporation by EagI and Pvul, respectively. To select for and 
maintain monoclonal cell lines expressing pmCherry-Flag transgenes the medium 
was supplemented with 0.3 pg ml! puromycin (Sigma). Cells were analysed 48 h 
after removal of tetracycline. 
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Regulation of ISWI involves inhibitory modules 
antagonized by nucleosomal epitopes 


Cedric R. Clapier' & Bradley R. Cairns! 


Chromatin-remodelling complexes (CRCs) mobilize nucleosomes 
to mediate the access of DNA-binding factors to their sites in vivo. 
These CRCs contain a catalytic subunit that bears an ATPase/ 
DNA-translocase domain and flanking regions that bind nucleoso- 
mal epitopes’. A central question is whether and how these flanking 
regions regulate ATP hydrolysis or the coupling of hydrolysis to 
DNA translocation, to affect nucleosome-sliding efficiency. ISWI- 
family CRCs contain the protein ISWI’, which uses its ATPase/ 
DNA-translocase domain to pump DNA around the histone octa- 
mer to enable sliding*’. ISWI is positively regulated by two ‘acti- 
vating’ nucleosomal epitopes: the ‘basic patch’ on the histone H4 
tail, and extranucleosomal (linker) DNA*°. Previous work defined 
the HAND-SANT-SLIDE (HSS) domain at the ISWI carboxy 
terminus that binds linker DNA, needed for ISWI activity'*’>. 
Here we define two new, conserved and separate regulatory regions 
on Drosophila ISWI, termed AutoN and NegC, which negatively 
regulate ATP hydrolysis (AutoN) or the coupling of ATP hydrolysis 
to productive DNA translocation (NegC). The two aforementioned 
nucleosomal epitopes promote remodelling indirectly by prevent- 
ing the negative regulation of AutoN and NegC. Notably, mutation 
or removal of AutoN and NegC enables marked nucleosome sliding 
without the H4 basic patch or extranucleosomal DNA, or the HSS 
domain, conferring on ISWI the biochemical attributes normally 
associated with SWI/SNF-family ATPases. Thus, the ISWI ATPase 
catalytic core is an intrinsically active DNA translocase that con- 
ducts nucleosome sliding, onto which selective ‘inhibition-of- 
inhibition’ modules are placed, to help ensure that remodelling 
occurs only in the presence of proper nucleosomal epitopes. This 
supports a general concept for the specialization of chromatin- 
remodelling ATPases, in which specific regulatory modules adapt 
an ancient active DNA translocase to conduct particular tasks only 
on the appropriate chromatin landscape. 

ISWLis the catalytic subunit of a set of CRCs, with orthologues in all 
eukaryotes’ (Fig. 1). To define how ISWI is intrinsically regulated, we 
undertook an extensive structure-function analysis of the Drosophila 
ISWI ATPase, and its regulation by the histone H4 tail and extranu- 
cleosomal DNA. In the amino terminus of ISWI, we recognized a 
region similar to the ‘basic patch’"® of the histone H4 tail (Fig. 1a) that 
is conserved in eukaryotes (Fig. 1b). To test function, we replaced 
either (R91A or R93A) or both (2RA) arginine residues with alanines. 
Nucleosome sliding by wild-type ISWI normally requires nucleosomes 
with intact H4 tails*’ and the presence of extranucleosomal DNA'”’; 
however, wild-type ISWI ATPase activity can be elicited by combining 
DNA and H4 tail peptide bearing the basic patch (Fig. 1c)'°. Notably, 
the ISWI(2RA) derivative showed ~threefold higher ATPase activity 
than wild-type ISWI and was independent of H4 epitopes (Fig. 1c). For 
extended nucleosomes (200 base pairs (bp), synthetic centrally posi- 
tioned sequence (601)), loss of either the basic patch (Ala 17, Ala 19) or 
the H4 tail (globular H4 (g4)) markedly diminished (fivefold) wild- 
type ISWI activity, whereas ISWI(2RA) was only modestly attenuated, 
retaining ~63% activity and ~ninefold higher activity than wild-type 
ISWI (Fig. 1d). Notably, ISWI(R91A) and ISWI(R93A) resemble 


moderate and weak gain-of-function derivatives, respectively; as their 
ATPase activities increase, their dependence on the H4 tail decreases 
(Supplementary Fig. 1). Thus, ISWI(2RA) greatly activates ATPase 
activity, and renders the enzyme largely independent of the H4 tail, 
defining an N-terminal autoinhibitory region (AutoN) with similarity 
to the H4 tail. 

Remodeller ATPases are SF2-family'® DNA translocases that couple 
DNA translocation to ATP hydrolysis*”'”-’, wherein the two RecA-like 
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Figure 1 | AutoN resembles the histone H4 tail basic patch and restricts 
ISWI ATPase activity. a, Alignment of Drosophila ISWI and the histone H4 
N terminus, with the basic patch depicted. b, ISWI protein and domains/regions, 
including ATPase, HSS and new functional regions (AutoN and NegC), with 
zoom of AutoN alignments for ISWI orthologues in Drosophila (Dm), Xenopus 
(X2), human and S. cerevisiae (Sc). The human orthologues SNF2H and SNF2L1 
are also known as SMARCAS5 and SMARCAlI, respectively. ¢, Mutation of 
AutoN (2RA) hyperactivates ISWI ATPase activity and bypasses H4 tail 
stimulation by peptides. Values are mean of =3 experiments, normalized to 
wild-type ISWI with DNA. Error bars denote + s.e.m. d, AutoN (2RA) mutation 
stimulates ISWI ATPase activity with mononucleosomes (nuc): native (WT), 
H4 tail-less/globular (g4), or containing basic-patch mutations (Ala 17, Ala 19). 
Numbers (147, 200) denote length (in bp) of DNA fragment. Colour code 
reflects nucleosome-sliding activity (in Fig. 2c, d). Values are mean of =3 
experiments, normalized to wild-type ISWI. Error bars denote + s.e.m. 
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protein lobes of the ATPase domain sequentially grip and release the 
DNA backbone, typically moving a small number of base pairs per ATP 
hydrolysis". ‘Coupling’ refers to how efficiently ATP hydrolysis results 
in productive DNA translocation. Uncoupling by mutation is well docu- 
mented”’”’, but of unknown relevance for regulation. The key question is 
whether and how coupling is regulated by nucleosomal epitopes. To 
monitor coupling, we used a ‘tethered translocation’ system, which uses 
plasmid supercoiling to quantify DNA translocation by ISWI (Sup- 
plementary Fig. 2). Notably, DNA translocation by wild-type ISWI 
was greatly enhanced by H4 tail peptide, whereas ISWI(2RA) showed 
robust DNA translocation without peptide (Fig. 2a), consistent with our 
ATPase results (Fig. Ic, d). 

Nucleosome sliding was monitored by achieving an equilibrium 
between end- and centre-positioned octamers, from an initial centre- 
positioned population (Fig. 2b). ISWI(2RA) yielded an end¢+centre 
equilibrium much faster than wild-type ISWI; ISWI(2RA) shifted in 
30s the amount shifted in 30 min by wild-type ISWI (Fig. 2b). Wild- 
type ISWI required the full H4 tail for in vitro sliding (shown prev- 
iously)®, and here we establish the importance of the basic patch 
(Fig. 2c, top panels). Notably, ISWI(2RA) partially bypassed mutations 
in the H4 basic patch, enabling sliding, but only weakly bypassed the 
full H4 tail (Fig. 2c, bottom panels). This suggested an additional H4 
epitope (beyond the basic patch) needed for coupling high ATPase 
activity to efficient sliding, and an extensive set of experiments 
involving peptides and nucleosomes bearing mutations in the H4 tail 
revealed a major contribution from H4 lysine 12 (H4K12) in promot- 
ing coupling (Supplementary Fig. 3). 

We next examined core nucleosomes (147 bp, 601 sequence). 
Supporting previous work"', wild-type ISWI activity required both 
extranucleosomal DNA and H4 tail epitopes (Figs 1d and 2d). 
Interestingly, with ISWI(2RA), both ATPase and sliding remained 
efficient on core nucleosomes (Figs 1d and 2d), establishing inde- 
pendence from extranucleosomal DNA. However, omitting the H4 
tail greatly reduced sliding but not ATPase activity with ISWI(2RA) 
(Figs 1d and 2d). Taken together, AutoN functions as an autoinhi- 
bitory region that, when mutated (in ISWI(2RA)), largely bypasses 
the H4 tail for ATPase stimulation, but retains reliance on the H4 
tail to ‘couple’ that ATP hydrolysis to productive DNA translocation. 
One simple model is that AutoN binds to and holds the ATPase 
domain in an inactive conformation, with the H4 tail competing 
for part of that surface; the binding of the H4 tail displaces AutoN 
and enables a conformational change that activates ISWI ATPase 
activity, an effect mimicked by the ISWI(2RA) mutation. Here, the 
ATPase activity of our derivative series (2RA>R91A>R93A) was the 
reciprocal of H4 tail stimulation (Supplementary Fig. 1), consistent 
with competition/antagonism for a region on ISWI involving the 
Hé4 tail and AutoN. Consistent with this model, a peptide from the 
AutoN region of wild-type ISWI (Lys 76-Glu 98) does not activate 
ISWI (Fig. 1c). 

Previous work defined a HSS domain present in the C terminus of 
ISWI (and the related yeast Chd1 protein) that binds extranucleo- 
somal DNA, needed for ISWI and Chd1 remodelling activity'*”*. 
However, our results with AutoN prompted a search for an analogous 
unknown region that prevents ATPase action unless the HSS is bound 
to extranucleosomal DNA. Multiple approaches (alignments, protease 
mapping, structural modelling) yielded a candidate region (covering 
residues 617 to 648) immediately following the second of the two 
RecA-like ATPase lobes, which we term NegC (Fig. 3a, b). A structural 
model using Phyre? (ref. 25; Fig. 3b) revealed NegC as a “C-terminal 
bridge’ traversing from ATPase lobe 2 back to the ATPase lobe 1, 
crossing the key functional ATPase cleft. Furthermore, this structural 
model of NegC is consistent with crosslinking experiments of ISWI’’, 
and a recent crystal structure of Chd1 (ref. 27). Notably, NegC is 
conserved only in remodellers that bear an HSS domain and are regu- 
lated by extranucleosomal DNA (ISWI and Chd1, not SWI/SNF), 
although NegC function is entirely unknown. 
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nucleosome sliding. a, Comparative DNA-translocation activity (see 
Supplementary Fig. 2) of ISWI derivatives. Translocation generates supercoiled 
topoisomers (SC). b, c, Comparative sliding of extended nucleosomes by ISWI 
derivatives, as a time course (b), and their reliance on H4 tail epitopes, in 
titration series (c). d, Comparative sliding activity on core nucleosomes reveals 
a reliance on H4 tail epitopes. For b and d, enzyme:substrate molar ratio is 1:2. 


To test NegC function, we isolated three ISWI C-terminal trun- 
cation derivatives: AC697 (omitting the HSS), AC648 (further dele- 
tion) and AC617 (omitting NegC), as well as combinations with 
AutoN mutations. First, omitting C-terminal regions from wild-type 
ISWI did not render them stimulatable by naked DNA (Fig. 3c). By 
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Figure 3 | NegC inhibits the coupling of ATP hydrolysis to DNA 
translocation, and nucleosome sliding. a, NegC conservation, revealed in 
zoomed alignment of ISWI orthologues. b, Modellization of ISWI(AC697) 
showing NegC (orange) traversing the cleft (red dashed line) from ATPase lobe 
2 (blue) to ATPase lobe 1 (green). Model generated using Phyre’; depicted with 
PyMol”®. c, The AutoN (2RA) mutation elicits robust DNA-dependent ATPase 
activity independent of the HSS. Values represent mean of three experiments, 


contrast, both full-length ISWI(2RA) and ISWI(2RA/AC) derivatives 
were highly stimulated by DNA (Fig. 3c), showing that the ATPase 
domain itself has an intrinsic DNA-dependent ATPase activity, which 
is held in check by the AutoN region. Notably, removal of the HSS 
domain (AC697 or AC648) prevented DNA translocation (Fig. 3d) 
while retaining high ATPase activity (Fig. 3c, for ISWI(2RA)), dem- 
onstrating a lack of coupling. Notably, further truncation to AC617 
(ISWI(2RA/AC617)) restored DNA translocation (Fig. 3d), estab- 
lishing that NegC negatively regulates the coupling of ATP hydrolysis 
to DNA translocation. 

We then tested these ISWI(AC) derivatives with core or extended 
nucleosomes that either contain or lack the H4 tail (Fig. 3e). As expected", 
omitting the HSS domain in ISWI(AC697) (or ISWI(AC648)) greatly 
reduced sliding of extended nucleosomes (lanes 3, 4, 19, 20). 
Surprisingly, ISWI(2RA/AC697) and ISWI(2RA/AC648) displayed 
high ATPase activity, but completely failed to slide either core (lanes 
23, 24) or extended (lanes 7, 8) nucleosomes, mirroring the un- 
coupling of ATPase activity and DNA translocation above (Fig. 3c, 
d). Remarkably, and in-keeping with our tethered translocation 
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normalized to wild-type ISWI, with 200-bp DNA fragment. Error bars denote 
+ s.e.m. d, Deletion of NegC through truncation restores DNA translocation. 
e, Impact of NegC region on sliding activities of core and extended nucleosomes 
(and H4 tail-less versions). ATPase activities (values, coloured by impact on 
sliding: high (green), moderate/low (orange), absent (red)). Enzyme:substrate 
molar ratio is 1:2. B, buffer. FL, full length. 


results, further removal of NegC (AC617) restored sliding activity, 
without an accompanying increase in ATPase activity. Restora- 
tion was moderate with ISWI(AC617) (lane 2) and efficient with 
ISWI(2RA/AC617) (lane 6) on extended nucleosomes. Similarly, with 
core nucleosomes, sliding was low but detectable with ISWI(AC617) 
(lane 18), and efficient with ISWI(2RA/AC617) (lane 22). Thus, NegC 
is a new region that negatively regulates sliding through ATPase/ 
translocation uncoupling. NegC is entirely separate from, and oppo- 
site in function to, characterized regions in other ATPases that pro- 
mote sliding””®. 

We then addressed how ISWI integrates the presence of nucleo- 
somal epitopes. For wild-type ISWI, omission of extranucleosomal 
DNA (lane 17) greatly impaired sliding, whereas omission of the 
Hé4 tail (lane 9, or omitting both, lane 25) prevented sliding. With 
ISWI(2RA), loss of both the H4 tail and extranucleosomal DNA 
prevented sliding (lane 29), even though ATPase activity was high; 
comparable to wild-type ISWI on extended nucleosomes (lane 1). 
However, ISWI(2RA) could slide extended nucleosomes lacking the 
H4 tail (lane 13), or core nucleosomes bearing H4 tails (lane 21). Thus, 
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both the H4 tail and extranucleosomal DNA contribute to coupling. 
Importantly, whereas ISWI(2RA/AC697) and ISWI(2RA/AC648) are 
unable to slide nucleosomes lacking the H4 tail (g4; lanes 15, 16, 31, 
32), removal of NegC (ISWI(2RA/AC617)) restored relatively efficient 
sliding of extended g4 nucleosomes (lane 14), and moderate sliding of 
core g4 nucleosomes (lane 30). Furthermore, removal of NegC from 
ISWI(2RA) restores sliding ability with nucleosomes bearing amino- 
acid substitutions in both H4K12 and the basic patch (Supplementary 
Fig. 4). Thus, with ISWI(2RA), NegC omission enables sliding of nucleo- 
somes without a basic patch or extranucleosomal DNA (Fig. 3e, lanes 6, 
14, 22, 30); properties that normally define SWI/SNF-family remodellers. 

To examine the effects of AutoN and NegC in vivo, we expressed (in 
Saccharomyces cerevisiae) mutations/truncations in Iswl equivalent to 
those from our ISWI biochemistry (see Figs 1b and 3a) and tested 
complementation (Supplementary Fig. 5) or dominant phenotypes 
(using a sensitized rsc7A (also known as NPL6) background, synthe- 
tically lethal with a wide range of chromatin mutants”). Interestingly, 
expression of an Iswl derivative lacking both AutoN and NegC 
regulation prevented growth (Fig. 4a), consistent with rogue chro- 
matin-misregulation activity (although loss of proper regulation by 
Isw1-associated proteins may also contribute). 

Taken together, our work provides several conceptual advances that 
affect both ISWI regulation and mechanism (Fig. 4b). Regarding regu- 
lation, the prior model involved ‘positive’ regulation of ISWI by the 
H4 basic patch and by extranucleosomal DNA. Here, we replace that 
conception with an inhibition-of-inhibition model, whereby these two 
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Figure 4 | Expression of ISWI derivatives in vivo, and regulation model for 
ISWI. a, Iswl lacking AutoN and NegC regulation greatly impairs growth of a 
S. cerevisiae strain sensitized for chromatin misregulation (rsc7A). Isw1 
derivatives expressed in an isw1A rsc7A strain bearing a URA3-marked RSC7* 
plasmid; growth on 5-FOA media enforces the loss of the plasmid, imposing an 
rsc7A genotype. SC-Leu, synthetic complete growth medium lacking leucine. 
b, Logic of ISWI ATPase regulation. The ISWI ATPase activity is positively 
regulated by DNA in the ATPase cleft. ISW1 is also negatively regulated by two 
intrinsic domains: AutoN and NegC. AutoN inhibits the ATP-hydrolysis rate, 
and is relieved by the basic patch (R17-R19) of the H4 tail, whereas NegC 
inhibits ATPase coupling to DNA translocation, and is relieved by the HSS 
domain binding to sufficient extranucleosomal DNA, defining the ‘inhibition 
of inhibition’ mode of ISWI regulation (note: an additional lysine (H4K12) also 
promotes coupling.) 
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nucleosomal epitopes function to relieve an intrinsic autoinhibition 
conferred by two new negative regulatory domains, AutoN and NegC. 
Notably, whereas AutoN functions primarily to inhibit ATPase acti- 
vity, NegC functions to inhibit ATPase coupling to DNA transloca- 
tion, with structural models for NegC supporting this function. By this 
model, interaction of the HSS domain with extranucleosomal DNA 
does not activate the ATPase per se, it relieves NegC and restores 
ATPase coupling. Regarding mechanism, a key current model involves 
the HSS pushing extranucleosomal DNA into the nucleosome, pro- 
viding the initial mechanical power stroke to forma translocated DNA 
loop on the surface of the nucleosome. However, as we observe effi- 
cient DNA translocation and nucleosome sliding following deletion of 
the HSS (when combined with NegC omission), the primary function 
of the HSS is not mechanical, but rather to regulate DNA translocation 
by antagonizing NegC, ensuring that coupling occurs only when extra- 
nucleosomal DNA is of sufficient length. 


METHODS SUMMARY 


ISWI protein derivatives were produced as N-terminal fusions to proteins 
coding for trigger factor and the Tet repressor DNA-binding domain (TF-TetR- 
ISWI), which heterodimerizes with an unfused TetR to enable tetO binding 
for DNA-translocation assays. Proteins were expressed in Escherichia coli 
BL21CodonPlus(DE3)RIL, and were purified to homogeneity as monodisperse 
derivatives. Chemically synthesized H4 tail peptides encompass residues Lys 8 to 
Asp 24 and AutoN peptide includes ISWI residues Lys 76 to Glu 98. 

The ATPase assays were performed at 26 °C using a colorimetric assay based on 
the formation ofa complex between inorganic phosphate and molybdate-malachite 
green. Reactions were an addition/omission series involving ISWI (10 pmol), 
pBluescript plasmid (500 ng), peptide (504M), DNA fragments (500 fmol) or 
nucleosomes (500 fmol), as indicated. 

Mononucleosomes used purified recombinant Drosophila histones expressed in 
E. coli BL21-CodonPlus(DE3)RIL, assembled with 147-bp or 200-bp DNA frag- 
ments (isolated from a plasmid by restriction digests, and containing the 601 
strong-positioning sequence) by a salt-dialysis linear gradient. For the 200-bp 
fragment, the 601 sequence was centrally located. 

The DNA-translocation assays measured plasmid supercoils generated by a 
single ISWI protein anchored through its TetR fusion to a previously relaxed 
(by E. coli topoisomerase I) plasmid DNA containing a single tetO operator 
sequence (Supplementary Fig. 2). Deproteinized samples were loaded on a 1.3% 
agarose gel, subsequently stained in ethidium bromide, and scanned ona Typhoon 
Trio (Amersham, GE). 

The nucleosome-sliding assays were performed at 26°C using a 1:2 enzyme: 
substrate molar ratio, except in the titration series. The reactions were stopped by 
adding competitor pBluescript plasmid DNA, and loaded on a 4.5% (37.5:1) native 
polyacrylamide gel, subsequently stained in ethidium bromide, and scanned on a 
Typhoon Trio. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Reagents. ISWI protein derivatives were expressed in Escherichia coli 
BL21CodonPlus(DE3)RIL, upon 0.5 mM isopropyl-f-p-thiogalactoside induc- 
tion at Deoonm ~0.5 for 24h at 15°C, as complexes from two vectors: one— 
pCDFDuet-1—bears a gene coding for TetR-(His)7, and the other—pCold TF 
(Takara) lacking the His tag—is a gene fusion containing TF-3C cleavage site- 
TetR-ISWI-Flag. A sequence of eight amino acids (GGQGGQGG) was inserted 
between the genes coding for TetR and ISWI. The properly assembled complex 
contains a heterodimer of TetR obtained by two successive affinity purifications. 
The bacterial-cell extracts were first mixed with nickel-nitrilotriacetic acid agarose 
resin (Qiagen), capturing the complex and the unwanted TetR homodimer. The 
sample eluted from the resin was then purified using the Flag tag and anti-Flag M2 
affinity gel and eluted with 3 x Flag peptide (Sigma) to obtain the desired complex. 
Then, trigger factor was cleaved from the complex using human rhinovirus 3C 
protease (Novagen) using 1 unit of protease per 0.1 mg of complex by incubating 
16h at 4°C, in the presence of 10mM f-mercaptoethanol and 0.5 mM EDTA. 
Finally, the purification of the complex was achieved by a gel-filtration step on two 
S200GL 10/300 (Amersham, GE) in series. Obtained complexes were homogen 
and monodisperse. 

Chemically synthesized H4 tail peptides encompass residues Lys 8 to Asp 24; 
AutoN peptide includes ISWI residues Lys 76 to Glu 98. 

Mononucleosomes were produced from single recombinant Drosophila his- 

tones expressed in E. coli BL21CodonPlus(DE3)RIL, purified as inclusion bodies, 
and assembled in octamers by salt-dialysis, essentially as described in ref. 31. The 
200- and 147-bp DNA fragments containing the 601 positioning sequence*” were 
respectively produced from plasmids pUC12 X 601 (digested by Aval) and pST55- 
16 X NCP601a (digested by EcoRV). For the 200-bp fragment, the 601 sequence 
was centrally located. The DNA fragments were purified from the backbone 
using preparative electrophoresis (PrepCell, Bio-Rad) 4.5% (37.5:1) native poly- 
acrylamide gel running at 400 V constant in 0.5 X TBE (Tris-borate 45 mM, pH 8, 
with 1mM EDTA), with TE (Tris 10 mM, 1mM EDTA) as elution buffer. 
Mononucleosome assemblies were performed by titration reactions, as 50-ttl reac- 
tion containing 40 pmol of DNA fragment in 2M KCl] mixed with a variable 
amount of histone octamers covering the equimolar ratio, with a linear salt- 
gradient dialysis applied from 2M to 50mM KCL, essentially as described in 
ref. 31, using an Econo-Pump (Bio-Rad) and Slide-A-Lyser Mini dialysis units 
with a 7,000 molecular weight cutoff (Thermo Scientific). 
ATPase assay. Measurement of ATP hydrolysis was based on the formation of a 
complex between inorganic phosphate and molybdate-malachite green. ATPase 
assays were performed in 10 mM HEPES buffer, pH 7.3, 20 mM potassium acetate, 
5mM MgCh, 0.5 mM dithiothreitol, 0.1 mg ml ! BSA and 5% glycerol, in the 
presence of 1mM ATP. After 30-min incubation at 26 °C with 500 r.p.m. shaking 
in a Thermomixer (Eppendorf), 800 pil of MGAM reagent (3 volumes of MG = 
0.045% (w/v) malachite green in 0.1 N HCl, mixed with 1 volume of AM = 4.2% 
(w/v) ammonium molybdate in 4N HCl) were added, followed 1 min later by 100 jl 
of 34% (w/v) Nascitrate. Measures were performed at Dgs50 nm 10 min later. In the 
plasmid DNA and peptide experiments, 10 pmol of ISWI was used with or without 
500 ng of pBluescript plasmid and/or 50 uM peptide. In the mononucleosome or 
DNA-fragment experiments, 10 pmol of ISWI was in the presence of 500 fmol of 
nucleosome or DNA fragment, respectively. 
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DNA-translocation assay. The 20-1] reaction contained 10 pmol ISWI, with or 
without 15 uM peptide, in the presence of 250 ng of previously relaxed plasmid, 
1mM ATP, 2.5 U of topoisomerase I (NEB) in NEB1 1X buffer, 1 mg ml! BSA. 
Experiments were performed at 30°C for 2h (or shorter during time course), 
followed by heat inactivation at 65°C for 20 min. De-proteinization was per- 
formed by adding 2 ul of proteinase K at 10mgml ’ and 11 SDS 20% and 
incubated at 50°C for 1h. Reactions were subsequently precipitated in ethanol, 
prior to loading on a 1.3% agarose gel run for 3h at 130 V. Gels were stained for 
20 min ina 1 pg ml! ethidium bromide solution, and scanned ona Typhoon Trio 
(Amersham, GE). Time-course experiments are performed using a starting 
reaction corresponding to a scale-up of amounts above, proportional to the num- 
ber of desired aliquots. 

Sliding assay. 100 fmol (or variable for titration) ISWI was incubated in 10 mM 
Tris buffer, pH 7.4, 50 mM KCI, 3 mM MgCl, 0.1 mg ml ' BSA, 1mM ATP, in the 
presence of 200 fmol of mononucleosome for 90 min at 26°C, with shaking at 
500 r.p.m. ina Thermomixer (Eppendorf). 10-1 reactions were stopped by adding 
200 ng competitor DNA (pBluescript plasmid) and incubated for an additional 
30 min as previously. Reactions were loaded using glycerol 10% on a 4.5% (37.5:1) 
native polyacrylamide gel and run in 0.4 X TBE for 55 min at 110 V constant. Gel 
was stained for 10 min ina 1 1g ml ! ethidium bromide solution, and scanned on 
a Typhoon Trio (Amersham, GE). Time-course experiments were performed 
using a starting reaction corresponding to a scale-up of amounts above, pro- 
portional to the number of desired aliquots. 

In vivo yeast experiments. Plasmids expressing Isw1 derivatives were generated 
by marker conversion to LEU2 (from S. cerevisiae) of pRS416-Isw1 (ref. 33), 
followed by site-directed mutagenesis (via PCR) generating AAA mutations and 
AC truncations. Final constructs were verified by sequencing. 

Complementation assay: the growth of S. cerevisiae strain TOH1358 lacking 
CHD1 and ISW1 genes is temperature sensitive**. Complementation experiments 
were performed by transforming the strain with plasmids expressing various 
derivatives of Isw1 under control of the endogenous promoter, performing tenfold 
spot dilutions, and observing the rescue of the growth ability at high temperature. 
Plate medium is SC-Leu. 

Testing the dominant-negative phenotype: here, we used a sensitized genetic 
background strain YBC 2233, lacking ISW1 and RSC7. The strain was transformed 
with plasmids expressing separate Isw1 derivatives under control of the endogen- 
ous ISW1 promoter, selecting for colonies on SC-Leu. Growth ability (in spot 
dilution format) was tested at 30°C on SC-Leu plates, which either lacked or 
contained 5-FOA (which enforces the loss of a URA3-marked plasmid bearing 
RSC7). 
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The TEL patch of telomere protein TPP] mediates 
telomerase recruitment and processivity 


Jayakrishnan Nandakumar’?, Caitlin F. Bell)?+*, Ina Weidenfeld!**, Arthur J. Zang, Leslie A. Leinwand!? & Thomas R. Cech!?? 


Human chromosome ends are capped by shelterin, a protein com- 
plex that protects the natural ends from being recognized as sites of 
DNA damage and also regulates the telomere-replicating enzyme, 
telomerase’*. Shelterin includes the heterodimeric POT1-TPP1 
protein, which binds the telomeric single-stranded DNA tail*”. 
TPP1 has been implicated both in recruiting telomerase to telo- 
meres and in stimulating telomerase processivity (the addition of 
multiple DNA repeats after a single primer-binding event)”™*. 
Determining the mechanisms of these activities has been difficult, 
especially because genetic perturbations also tend to affect the 
essential chromosome end-protection function of TPP1 (refs 15-17). 
Here we identify separation-of-function mutants of human TPP1 
that retain full telomere-capping function in vitro and in vivo, yet 
are defective in binding human telomerase. The seven separation- 
of-function mutations map to a patch of amino acids on the surface 
of TPP1, the TEL patch, that both recruits telomerase to telomeres 
and promotes high-processivity DNA synthesis, indicating that 
these two activities are manifestations of the same molecular inter- 
action. Given that the interaction between telomerase and TPP1 is 
required for telomerase function in vivo, the TEL patch of TPP1 
provides a new target for anticancer drug development. 

Genetic analysis of TPP1 regulation of telomerase is complicated 
by the potential of TPP1 mutations to uncap telomeres, which can 
give unnatural telomerase hyperextension or induce a DNA-damage 
response. Our search for separation-of-function mutants of TPP1 that 
would affect only its telomerase interactions was conducted on the 
TPP1 OB domain (TPP1(OB)), which is separate from its POT1 sin- 
gle-stranded DNA (ssDNA)-interaction domain (Fig. la) and prev- 
iously implicated in telomerase interaction’*"*. We engineered 12 
single amino-acid mutants and two double mutants of TPP1(OB), 
choosing residues that are both conserved among mammalian TPP1 
proteins and reside on the protein surface’ (Supplementary Fig. 1a, b). 
All mutant and wild-type proteins were expressed recombinantly in 
Escherichia coli and purified (Supplementary Fig.1c; see Methods). 

The POT1-TPP1 heterodimer has a higher affinity for telomeric 
ssDNA than does POT] alone’, which is consistent with the role of this 
complex in protecting single-stranded telomeric DNA from the DNA- 
damage response machinery’’. Quantitative DNA-binding experiments 
demonstrated that all the mutant proteins, like wild-type TPP1, were 
able to enhance POTI1-DNA affinity (Fig. 1b and Supplementary Fig. 
2a). Furthermore, all TPP1 mutants formed stable POT1-TPP1-ssDNA 
ternary complexes in electrophoretic mobility shift assays (Supplemen- 
tary Fig. 2b). These results suggest that the TPP1(OB) surface mutations 
do not disrupt the overall structure or the DNA-end protection function 
of TPP1. 

Addition of purified POT1-TPP1 to telomerase enzyme activity 
assays enhances telomerase processivity 2-3-fold in vitro’. We 
developed a new direct telomerase assay in which TERT, TR, TPP1 
and POT1 were recombinantly co-expressed in human cells. Lysates 


from these cells were used in direct telomerase activity assays involving 
extension of primer a5 (TTAGGGTTAGCGTTAGGG; the underlined 
G-to-C mutation ensured positioning of the POT1-TPP1 complex at 
the 5’ portion of the DNA primer, thereby providing a homogenous 
substrate for telomerase extension’). The 14 TPP1 mutants varied 
substantially in their stimulation of processivity; some such as S106A, 
S111A and R175V, resulted in long extension products indicative of 
high processivity, similar to wild-type TPP1, but others, such as E169A/ 
E171A and L212A, resulted in low-processivity patterns similar to that 
with no TPP1 added (Fig. 1c). Quantification of data (Fig. 1d) showed 
that several TPP1 mutants gave significantly decreased processivity 
(P< 0.005). Western blot analysis showed that TERT, POT1 and 
TPP1 proteins were expressed at similar levels across all transfections 
(Fig. le). Note that processivity, unlike activity, is not influenced by 
enzyme concentration. 

As an independent test of processivity, lysates from HEK 293T cells 
transiently overexpressing telomerase were supplemented with purified 
POT1 and TPP1 proteins before the telomerase assay, as described 
previously”""'® (Supplementary Fig. 3; see also alternative quantifica- 
tion of processivity in Supplementary Fig. 4). This traditional assay gave 
results in agreement with the new co-transfection method. We also 
analysed the 14 TPP1 mutants using a primer with a purely telomeric 
sequence and a different sequence permutation (GGTTAGGGTTAG 
GGTTAG); we obtained essentially the same results as with primer a5 
(Supplementary Fig. 5), verifying that the TPP1 mutant phenotype was 
not primer specific. All three assays identified the same separation-of- 
function mutants of TPP1 (L104A, D166A/E168A, E169A/E171A, 
R180A, L183A and L212A), with E215A and R208V being inter- 
mediate and the other mutants having wild-type activity. Further dis- 
section of the two double mutants into four single mutants revealed 
that E168, E169 and E171, but not D166, were important for telomerase 
stimulation (Supplementary Fig. 3). Most of the TPP1 mutants that 
were defective in processivity also gave a statistically significant decrease 
in activity (Supplementary Fig. 3b). In agreement with previous results’, 
the stimulation of activity (and its dependence on specific TPP1 resi- 
dues) was observed even in the absence of added POT! protein 
(Supplementary Fig. 6). 

The effects of TPP1 mutations might be explained by disruption 
of a TPP1-telomerase interaction. To test this directly, we developed a 
co-immunoprecipitation assay involving co-transfection of Flag- 
TPP1 and untagged telomerase (TERT plus TR) in HeLa-EM2-11ht 
cells. Inspection and quantification by immunoblot of proteins co- 
precipitated with anti-Flag beads shows that TERT, but not actin, was 
associated with Flag~TPP1 (Fig. 2a, compare lanes 1 and 2). More 
TERT protein was co-precipitated with Flag-TPP1 when POT1, the 
in vivo binding partner of TPP1, was also present (compare lanes 2 and 
4). Addition of primer a5 (a ligand for POT1-TPP1 and a substrate for 
telomerase) to lysates containing Flag~-POT1 and Flag-TPP1 before 
immunoprecipitation increased the TPP1-telomerase interaction by 
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Figure 1 | Separation-of-function mutants of TPP1 affect telomerase 
processivity without affecting telomere complex formation. a, Domain 
architecture of POT1-TPP1 shown alongside the crystal structures of the 
DNA-binding domain of POT1 (cyan surface) bound to DNA (red ribbon; 
Protein Data Bank (PDB) accession 1XJV) and the TPP1(OB) domain (PDB 
accession 2146). b, DNA binding curves from filter-binding of mixtures 
containing trace amounts of **P-labelled ssDNA (GGTTAGGGTTAG), 

200 nM TPP1, and varying concentrations of POT1. See Supplementary Fig. 2a 
for dissociation constant (Kg) values. WT, wild type. c, Direct telomerase 
activity assay with primer a5 of lysates from cells co-transfected with a TR 
plasmid and Flag-tagged POT1, TPP1 and TERT plasmids. ‘No POT1-TPPI’ 


~2.5-fold compared to the basal interaction in the absence of POT 1 and 
a5 DNA (compare lanes 2 and 6). The enhanced interaction between 
telomerase and POT1-TPP1 is unlikely to be occurring indirectly via 
bridging of TPP1 and telomerase by POT1-DNA, because Flag—-POT1 
did not pull down telomerase even in the presence of a5 DNA (lane 5). 

The interaction between TPP1 and telomerase was reduced when 
either TR or TERT was omitted from co-transfections (Supplementary 
Fig. 7a, b). These results indicate that TPP1 interacts with telomerase 
optimally in the context of the fully assembled RNP. The TEN (telo- 
merase essential N-terminal) domain of TERT has been implicated ina 
functional interaction with POT1-TPP1” and in binding to TPP1(OB)”. 
However, deletion of the TEN domain or mutating G100V (ref. 19) 
did not abolish telomerase binding (Supplementary Fig. 7c and 
ref. 21). These data indicate that the observed immunoprecipitation 
of telomerase with TPP1 involves contacts both within and outside the 
TEN domain of TERT, which is consistent with the implication of 
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indicates transfection without POT1 and TPP1. The number of telomeric 
repeats added to primer are indicated along the left. d, Processive extension 
(>15 repeats/total) with TPP1 mutants relative to that with wild-type TPP1 
(green line) obtained from three independent sets of experiments (as in panel 
c); error bars indicate s.d. Stimulation of processivity is assessed relative to the 
‘no POT1-TPPI’ negative control (red line). Two-tailed student’s t-test with 
respect to R175V: *P < 0.02, **P < 0.005. Red labels indicate significantly 
defective; green labels indicate not significantly defective. e, Immunoblot of 
lysates used in panel c probed with anti-Flag antibody-HRP conjugate shows 
uniform expression. 


C-terminal TERT residues in TPP1(OB)-mediated recruitment of tel- 
omerase to telomeres”’. 

When the TPP1 mutants were tested for binding telomerase (Fig. 2b, 
c), there was strong correlation with the telomerase stimulation results. 
Five of the six mutants that showed a defect in telomerase stimulation 
in vitro also showed defects in telomerase association in the presence of 
POT1 and DNA primer, and the intermediate mutant E215A was also 
defective in binding. An exception was mutant L104A, which showed 
wild-type levels of associated telomerase. 

When the separation-of-function mutations that fail to stimulate 
processivity and also interfere with telomerase binding are mapped on 
the surface of the TPP1(OB) structure, the seven critical amino acids 
(E168, E169, E171, R180, L183, L212 and E215) cluster to reveal a 
patch on the protein surface (orange in Fig. 2d). We name this con- 
served surface the TEL patch (TPP1 glutamate (E) and leucine (L)-rich 
patch). The fact that the TEL patch promotes both processivity and 
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Figure 2 | TPP1 mutations that disrupt telomerase stimulation also disrupt 
telomerase binding. a, Pull down of transiently expressed Flag~TPP1, Flag— 
POT] and associated untagged TERT from HeLa-EM2-11ht lysates on anti- 
Flag conjugated beads. ‘Input’ indicates immunoblot of soluble cellular lysates 
before incubation with beads; ‘beads’ indicates immunoblot of proteins 
retained on antibody beads after 2 h incubation at 4 °C and washing. The TERT 
signal was plotted after correction for loading differences (using the actin 
signal). b, Comparison of TERT pull down by Flag-TPP1 mutants in the 
presence of Flag~POT1 and primer a5. Control 1, Flag~-POT1/TPP1 and 


telomerase binding fulfils a prediction of a previous model for telo- 
merase processivity'’. The ‘back face’ of TPP1(OB) (bottom view of 
Fig. 2d) contains amino acids not involved in binding telomerase. 

Previously, knockdown of TPP1 in human HTC75 cells resulted in 
telomere lengthening’, whereas skin keratinocytes from Tpp1“/“K5-cre 
newborn mice displayed telomere shortening’. However, loss of both 
end-protection and telomerase-regulatory functions of TPP1 compli- 
cates interpretation of such studies. To test how the TPP1-telomerase 
interaction influences telomere maintenance under conditions where 
chromosome end protection is unperturbed, we constructed stable 
cell lines containing a single-copy gene cassette with a bidirectional 
tetracycline-inducible promoter (Ptet-bi, Fig. 3a). The promoter 
expresses both a short hairpin RNA (shRNA; shTPP1) to knockdown 
endogenous TPP1 (Fig. 3b, c and Supplementary Fig. 8a-c) and a 
shRNA-resistant version of Flag-TPP1 (Flag-TPP1* or wild type*; 
Fig. 3d and Supplementary Fig. 8a, d). E169A/E171A*, R175V* and 
L212A* cell lines express shRNA-resistant versions of the various 
mutant TPP1 proteins (asterisks indicate shRNA-resistant mRNAs). 
We also constructed a similar set of cell lines in which various TPP1 
proteins were ectopically expressed without knocking down the endo- 
genous TPP1 (Supplementary Figs 9a, b and 10a, b). 

Overexpression of TPP1 as described above (~25-fold; data not 
shown) could potentially sequester shelterin components from telo- 
meres, leading to the appearance of telomere-dysfunction-induced foci 
(TIFs)'*. However, expression of the TPP1 mutants resulted in the 
same low levels of TIFs as wild type (Supplementary Fig. 11). These 
data are consistent with the unaltered POT1-DNA binding properties 
of the TPP1(OB) mutants (Fig. 1b) and with a previous study showing 
that deletion of the entire TPP1(OB) domain fails to elicit chro- 
mosome end-deprotection™. 

The stable cell lines expressing wild-type and control (R175V) Flag- 
TPP1 showed a robust increase in telomere length over 81 population 
doublings (~40 bp per population doubling) in a doxycycline-dependent 
(Flag-TPP1 expression-dependent) manner, whereas the telomeres of 
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primer a5 omitted; control 2, Flag-TPP1 (WT) present, Flag-POT1 and primer 
omitted; control 3, Flag~POT1 and primer present, Flag~TPP1 omitted. c, The 
mean TERT signal on beads obtained from quantification of three independent 
sets of experiments of which panel b is representative; error bars indicate s.d. 
Two-tailed Student’s t-test with respect to RI75V: *P < 0.05, **P < 0.01. The 
red label indicates significantly defective; green label indicates not significantly 
defective. d, Structure showing the TEL patch (amino acids in orange), the 
surface of the OB domain of TPP1 that mediates telomerase association and 
stimulation. Other amino acids mutated in this study are shown in cyan. 


the untransfected cell line were stable (Fig. 3e, f and Supplementary 
Figs 9c, d and 10c, d). In contrast, the telomeres of cell lines expressing 
E169A/E171A and L212A did not change in length over 81 population 
doublings. This result is in full agreement with our data showing that 
E169A/E171A and L212A have a greatly diminished ability to stimu- 
late telomerase (Fig. 1) and to associate with telomerase (Fig. 2), but 
that R175V behaves like the wild-type protein. We conclude that TPP1 
increases telomerase action in vivo and that the TEL patch of TPP1 is 
necessary for this effect. 

We tested whether the inability of the E169A/E171A and L212A 
mutants to stimulate telomere lengthening was due to their inability 
to recruit telomerase to telomeres. The cell lines stably co-expressing 
shTPP1 and TPP1* (wild type*, E169A/E171A*, R175V%*, or L212A*) 
were transfected with expression plasmids encoding TERT and TRand 
analysed by co-immunofluorescence (co-IF) and immunofluorescence- 
fluorescence in situ hybridization (IF-FISH). As expected, wild-type 
and all mutant Flag-TPP1 proteins localized to telomeres (Sup- 
plementary Fig. 12). Telomerase was efficiently recruited to telomeres 
containing wild-type and R175V proteins, as indicated by the co- 
localization of TR and Flag-TPP1 (Fig. 4a, b). By the same criterion, 
the E169A/E171A* and L212A* cell lines showed reduced levels of 
telomerase recruitment to telomeres. Instead, the TR foci in E169A/ 
E171A* and L212A%* cells resided in Cajal bodies (marked by coilin; 
Fig. 4c), indicating that the corresponding TPP1 mutations prevent 
telomerase in Cajal bodies from being delivered to telomeres. TEL 
patch mutations also reduced recruitment of endogenous TR to telo- 
meres (Supplementary Fig. 13). 

We conclude that a small patch of amino acids on the surface of the 
chromosome-end-binding protein TPP1, the TEL patch, binds telo- 
merase to enable both its recruitment to telomeres and its high- 
processivity extension of telomeric DNA. While this manuscript was 
under review, two groups independently reported the involvement of 
portions of the TEL patch in telomerase association*' and telomerase 
recruitment”. 
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Figure 3 | TPP1 TEL-patch mutants fail to stimulate telomere lengthening 
in human cells. a, Engineering HeLa-EM2-11ht stable cell lines containing 
single-copy integration of bidirectional Tet-inducible shTPP1 and shRNA- 
resistant Flag~TPP1* (wild type or mutants) genes. b, A stable cell line 
encoding shTPP1 (no exogenous Flag-TPP1) shows doxycycline-dependent 
knockdown of endogenous TPP1 protein. Nonspecific bands serve as loading 
controls. c, Quantitative RT-PCR showing knockdown of endogenous TPP1 
mRNA in the indicated cell lines + s.d. (n = 3). d, Western blot showing similar 
protein levels of shRNA-resistant Flag-TPP1 in the indicated cell lines also 
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Figure 4 | Failure to stimulate telomere lengthening correlates with 
inability to recruit telomerase to telomeres. a, Fluorescence in situ 
hybridization (FISH) detects TR (green) and immunofluorescence (IF) detects 
the indicated Flag-TPP1 proteins (red). In the merge panels, yellow spots 
indicate recruitment of telomerase to telomeres. b, Quantification of telomerase 
recruitment data of which panel a is representative. The average percentage of 
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expressing shTPP1. e, Telomeric restriction fragment (TRF) Southern blot of 
DNA from HeLa-EM2-11ht (untransfected HeLa) and stable cell lines 
expressing shTPP1 and the indicated TPP1 constructs at the indicated 
population doublings (PD). DNA length standards are indicated along the left. 
f, Change in mean telomere length for data shown in panel e was plotted against 
population doublings. The vertical bars at the right indicate the distinct ranges 
of telomere length attained by wild-type* and R175V* versus E169A/E171A* 
and L212A* cells after 81 population doublings. 
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Flag-TPP1 foci containing TR (y axis) and standard deviations (error bars) of 
15 fields of view (40-120 cells total) were plotted for the indicated stable cell 
lines. c, IF-FISH experiment showing that telomerase (TR in green) in E169A/ 
E171A* and L212A% cells fails to be recruited to telomeres (Flag-TPP1 in red) 
and instead remains localized in Cajal bodies (coilin in blue). 
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In the simplest model, the TEL patch binds telomerase directly. If 
there were a bridging molecule, it would probably be an abundant 
cellular component, because TPP1-telomerase binding is robust 
under conditions of TPP1 and telomerase overexpression, where any 
telomerase-specific factors would be substoichiometric. There is pre- 
cedence for such a bridging molecule in Schizosaccharomyces pombe, 
where Ccq1 physically connects Tpzl (the S. pombe TPP1) to telo- 
merase” **. However, there is no identifiable homologue of Ccq1 in 
humans (see Supplementary Discussion for comparison with telo- 
merase recruitment in budding yeast). 

The substantial reduction of telomerase recruitment by subtle muta- 
tions in the TEL patch of TPP 1 indicates the potential for novel strategies 
to inhibit telomerase for cancer therapy. By targeting the TEL patch, 
instead of telomerase itself, a compound could potentially inhibit 
telomerase action only at telomeres without interfering with other 
hTERT functions*”*. Furthermore, such an inhibitor should not per- 
turb genomic stability in normal cells lacking telomerase, because the 
TEL patch is physically and functionally separate from the portion of 
TPP1 engaged in chromosome end-protection. 


METHODS SUMMARY 


All experiments involving recombinant TPP1 expressed in bacteria’ used 
TPPI1(N) (amino acids 87-334), which recapitulates all in vitro functions of the 
full-length human protein’. Experiments involving TPP 1 expressed in human cells 
used full-length TPP1 that we define as starting from amino acid Met 87 (refs 9, 14) 
(Fig. 1a). Telomerase was obtained from HEK 293T cells transfected with human 
TERT and TR plasmids’* or from HeLa-EM2-1 ht cells transfected with Flag- 
TERT, TR, Flag-POT1 and Flag-TPP1 plasmids. Cell lines were engineered by 
integrating gene cassettes containing Tet-inducible TPP1 genes into a unique 
targeting site in the genome of the HeLa-EM2-11ht cell line (Tet Systems 
Holdings GmbH & Co. KG) using Flp recombinase-mediated cassette exchange. 
Where indicated, the integration cassette contained a bi-directional Tet-inducible 
promoter driving both shTPP1 and shRNA-resistant TPP1 genes in opposite 
directions. FISH and IF experiments were performed by modifying existing pro- 
tocols”’ (http://delangelab.rockefeller.edu/protocols.html). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


All experiments involving recombinant TPP1 expressed in bacteria’ used TPP1(N) 
(amino acids 87-334), which recapitulates all in vitro functions of the full-length 
human protein’. Experiments involving TPP1 expressed in human cells used full- 
length TPP1 that we define as starting from amino acid Met 87 (refs 9, 14) (Fig. 1a). 
Oligonucleotides. Synthetic oligonucleotides used in gel-shift, filter-binding and 
telomerase activity assays were purchased from Integrated DNA Technologies and 
resuspended in 10 mM Tris-Cl (pH 8.0) to obtain 50 uM stocks that were stored at 
—20°C and diluted to required concentrations immediately before use. 

Plasmid constructs for cell-based assays. Mammalian cell expression plasmids 
encoding human TERT (pTERT-cDNA6/Myc-His C'*) and human TR (phTR- 
Bluescript II SK(+)’*) were a gift from J. Lingner. The pdlgfpPtetmiR and pTet- 
BI4 vectors, and HeLa-EM2-11ht cell line were obtained from Tet Systems 
Holdings GmbH & Co. KG upon signing a materials transfer agreement. The 
pTet-BI4 vector, which contains a bidirectional tetracycline-inducible promoter 
(Ptet-bi), allows for doxycycline (dox)-dependent expression (Tet-on) of genes in 
the HeLa-EM2-11ht cell line, which constitutively expresses the reverse tetracy- 
cline-controlled transactivator gene**. The pIRES2-EGFP Rac2 plasmid” that was 
used to PCR-amplify IRES-GFP was a gift from G. Johnson. 

shRNA and shRNA-resistant constructs of TPP1. The target sequence for 
shRNAs illustrated in Supplementary Fig. 8 are as follows: shRNA-A, 5’-CGT 
TGCATCCGCTGGGTGT-3’; shRNA-B, 5'-TGGAGTTCAAGGAGTTTGT-3’; 
shRNA-C, 5'-GACTTAGATGTTCAGAAAA-3'". shTPP1, shown in Fig. 3, 
contains two repeats each of shRNA-B and shRNA-C arranged in tandem. shRNA- 
resistant mutations were introduced by site-directed mutagenesis using the 
following primers (and their reverse complements): shRNA-A resistant, 5’-GGA 
AACCCGGGCCCccTtCAcCCGCTcGGaGT gGCCGTGGGGATG-3’; shRNA-B 
resistant, 5'-CCCCAGAAACCTAGCCTcGAaTTtAAGGAaTTcGTcGGGTTG 
CCCTGCAAG-3'; shRNA-C resistant, 5’-GGGTGCCTGGTTGCAACCAAG 
AtcTtGATGTcCAaAAgAAGCTCTATGACTGCCTTGAGG-3’; where lower-case 
indicates silent mutations introduced in the TPP1 gene to make the mRNA res- 
istant to the indicated shRNA. In addition to possessing the duplex RNA sequence 
for silencing, all shRNA constructs also contained critical sequence elements 
derived from microRNA mR-30 for increasing the efficiency of RNA silencing”. 
Immunoblotting. Standard immunoblot protocols were used with the following 
antibodies at specified dilutions: mouse monoclonal anti-Flag M2-HRP conjugate 
(Sigma; A8592; 1:10,000), rabbit monoclonal TERT (C-term) antibody 
(Epitomics; 1531-1; 1:500), mouse monoclonal anti-f-actin antibody (Sigma; 
A5441; 1:10,000), and mouse monoclonal TPP1 antibody (Abnova; H00065057- 
M02; 1:1,000). Secondary horseradish peroxidase-conjugated goat antibodies 
against rabbit IgG (Santa Cruz Biotechnology; 1:10,000) or donkey antibodies 
against mouse IgG (Jackson Immunoresearch; 1:10,000) were used to reveal the 
primary antibodies using chemiluminescence detection by ECL plus reagents (GE 
Healthcare Lifesciences). The data were visualized and quantified using a 
FluorChem HD2 (Alpha Innotech) imaging system. 

Molecular cloning. C-terminally 3X -Flag-tagged human TPP1 and C-terminally 
3X-Flag-tagged human POT! genes were amplified from human TPP! cDNA 
(Open Biosystems; Thermo Scientific) and POT1 cDNA (pET-Smt3-POT1; 
unpublished results), respectively, using forward primers containing a Not! site 
upstream of the start codon and reverse primers that contained: a stop codon-to- 
serine mutation, a DNA sequence coding for a 3X -Flag tag, and a terminal Sall site 
at the 5’ end. For transient transfection experiments, the PCR fragments were 
restriction digested and cloned upstream of the IRES-GFP in the pTet-IRES- 
eGFP-BI4 vector (obtained by cloning a PCR-amplified and restriction digested 
IRES-eGFP fragment into pTet-BI4) to furnish p3X-Flag-TPP1-BI4 and p3x- 
Flag-POT1-BI4 plasmids. Using this construct, TPP1 and GFP proteins are syn- 
thesized as separate polypeptides in the cell. For stable transfection purposes, the 
TPP1-IRES-GFP fragment was subcloned from the p3X -Flag-TPP1-BI4 vector to 
the pdlgfpPtetmiR vector (replacing the existing d1GFP gene in the vector) that 
contains the F/F3 Flp recombination sites flanking the multiple cloning site to 
yield p3 X-Flag-TPP1-F3 constructs that were subsequently used for stable trans- 
fection. p3 X-Flag- TERT-cDNA6/Myc-HisC, which contains an N-terminal 3X<- 
Flag tag immediately upstream from the TERT start codon, was obtained via site- 
directed mutagenesis of pTERT-cDNA6/Myc-HisC". For stable co-transfections 
with shTPP1 and shTPP1-resistant TPP1, the bi-directionality of the pTet pro- 
moters of pTet-IRES-eGFP-BI4 and pd1gfpPtetmiR was exploited. Whereas the 
TPP1 constructs were cloned as mentioned above, the shTPP1 was cloned between 
the Spel/Nhel sites of the second multiple cloning site downstream of the second 
pTet element. The shTPP1-resistant mutations were made by site-directed muta- 
genesis (as detailed below). 

Site-directed mutagenesis. Missense mutations were introduced into the TPP1 
gene using fully complementary mutagenic primers (QuickChange site-directed 


mutagenesis kit from Agilent Technologies). For the generation of TPP1(N) 
mutants for overexpression in bacteria, the pET-Smt3-TPP1(N) plasmid’ was 
used as the template in the PCR, whereas for expression in the HeLa~-EM2-11ht 
cell line, p3X-Flag-TPP1-BI4 was used as the template in the PCR. The TPP1 
genes in the mutant plasmids were sequenced completely to verify the presence of 
the intended mutation(s) and exclude the acquisition of unwanted changes during 
PCR amplification and cloning. 

Protein purification. Purified mutant and wild-type Smt3-TPP1(N) fusion pro- 
teins were obtained from soluble lysates of isopropyl §-d-thiogalactopyranoside- 
induced BL21(DE3) cells after nickel-agarose chromatography, treatment with 
Ulp1 protease to cleave the Smt3 tag*’ and size-exclusion chromatography’. 
Purified recombinant human POT1 protein was obtained from baculovirus- 
infected insect cells as described previously’. 

Gel-shift and filter-binding assays. Electrophoretic mobility shift assays and 
filter-binding assays of POT1-TPP1-oligonucleotide complexes were performed 
exactly as described previously”. 

Cell culture. All human cells were cultured at 37 °C with 5% CO . HEK 293T cells 
were cultured in growth medium containing Dulbecco’s modified Eagle medium 
(DMEM) supplemented with 10% fetal bovine serum (FBS), 2 mM glutamax (Life 
Technologies), 100 units ml”! penicillin, and 100 pg ml~’ streptomycin. HeLa- 
EM2-11ht and stable cell lines derived from it were propagated in growth medium 
containing DMEM, 10% FBS, 2mM glutamax, 1 mM sodium pyruvate, 100 units 
ml * penicillin and 100 1g ml! streptomycin. Doxycycline was added to a final 
concentration of 200 ng ml’ for induction of pTet-driven genes. 

Telomerase preparations. Telomerase preparations from HEK 293T cell extracts 
(super-telomerase) were obtained from transient transfection of TERT and TR 
using a published protocol’*. For telomerase preparations from HeLa-EM2-11ht 
cells, 300,000 cells per well of a 6-well plate were seeded and transfected 42 h later 
using Lipofectamine 2000 (Invitrogen) and indicated plasmid DNA using the 
manufacturer’s recommendation. 1 jig p3-Flag- TERT-cDNA6/Myc-HisC and 
3 tg phTR-Bluescript II SK(+)'* were added per transfection. For transfections 
involving POT1 or TPP1, 1 pg of p3x-Flag-POT1-BI4 or p3X-Flag-TPP1-BI4 
was added per transfection. In control transfections where POT1 and TPP1 were 
omitted, pTet-B14** (empty vector) was included. Medium was removed after 5h 
and exchanged with fresh medium containing 200 ng ml ' doxycycline to induce 
TPP1 and/or POT] expression from the tetracycline-driven promoter. After 48h 
of transfection, the cells were trypsinized, washed with phosphate buffered saline 
(PBS), re-suspended in 100 tl CHAPS lysis buffer'’ containing 1 jl of RNasin plus 
(Promega), mixed with rocking on a nutator at 4°C for 20 min, and centrifuged 
(13,600 r.p.m., 10 min) to remove cell debris. Aliquots of the soluble cell lysates 
were flash frozen in liquid nitrogen and stored at —80 °C. 

Telomerase activity assays. Telomerase reactions were carried out as 20 tl reactions 
containing: 50 mM Tris-Cl (pH 8.0), 30 mM KCl, 1mM MgCl, 1 mM spermidine, 
5 mM f-mercaptoethanol, the indicated concentration of primer a5 or b’, 500 uM 
dATP, 500M dTTP, 2.92 1M unlabelled dGTP, 0.33 uM radiolabelled dGTP 
(3,000 Cimmol '), and 3 ul of super-telomerase cell extract (or telomerase from 
HeLa-EM2-11ht cell extracts) at 30°C for 30 min. Reactions were stopped with 
buffer containing 100 ul of 3.6 M ammonium acetate, 20 ug of glycogen and a 5’ 
end-labeled (7.5 c.p.m. pl’) 18mer oligonucleotide loading control, and precipi- 
tated with ethanol. The pellets were re-suspended in 10 pl H2O and 10 pl ofloading 
buffer (95% formamide, 5% H,O, loading dye), heated at 95°C for 10 min, and 
loaded onto a 10% acrylamide, 7 M urea, 1X TBE sequencing gel. Gels were run at 
90 W for 1.75 h, dried, exposed to an image plate and imaged on a phosphorimager 
(TyphoonTrio). The data were analysed using Imagequant TL software and telo- 
merase activities quantified as described previously’’. Processivity calculations 
were performed as described previously’ (Supplementary Fig. 4) although it 
was not possible to include the higher molecular weight products on the gel in 
these calculations due to severe band overlap. To include the contributions of these 
upper bands (which are the result of the highly processive action of telomerase; 
data not shown) in processivity measurements, we defined a ‘15+’ relative pro- 
cessivity term as the fraction of the total activity present in bands resulting from 15 
(number chosen arbitrarily to define the lower limit of high processivity) or more 
telomeric repeats added by telomerase (Fig. 1d and Supplementary Fig. 3c), noting 
that the ‘15+’ method does not correct for the greater number of G nucleotides 
(three per repeat) in the upper bands compared to lower bands. 
Co-immunoprecipitation experiments. The protocol was adapted from a prev- 
iously published protocol’. HeLa-EM2-11ht cells were seeded and transfected as 
stated above for telomerase preparations. After 24h of transfection, cells were 
washed with PBS, dislodged with a cell-scraper using 400 il ice-cold lysis buffer 
(50 mM Tris-Cl (pH 7.4), 20% glycerol, 1 mM EDTA, 150 mM NaCl, 0.5% Triton 
X-100, 0.02% SDS, 1mM dithiothreitol, 2mM phenylmethylsulphony] fluoride, 
complete protease inhibitor cocktail (Roche)) and kept on ice. After 5 min, 20 pl of 
5M NaCl was added and mixed. After another 5 min on ice, 420 ul of ice-cold 
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water was added and mixed before immediate centrifugation (13,600 r.p.m., 
10 min). Supernatants were collected and used directly for immunoprecipitation. 
Lysate (40 pl) added to 40 pil of 4X LDS sample loading buffer (Invitrogen) was 
kept aside for analysis of ‘input’ samples. Anti-Flag/M2 affinity gel beads (50 1; 
Sigma) pre-incubated with 100 mg ml * of bovine serum albumin in PBS were 
added to lysate prepared from each well of a 6-well plate. Mixtures were mixed 
with rocking on a nutator for 2 h at 4°C. Beads were washed three times with 1:1 
diluted lysis buffer and proteins were eluted with 180 il 1:1 diluted 4x LDS sample 
loading buffer (Invitrogen) for analysis of ‘beads’ samples. All ‘input’ (10 pl) and 
‘beads’ (15 jl) samples were heated at 90 °C for 12-14 min and analysed by SDS- 
PAGE and immunoblotting. 

Stable cell line generation using HeLa-EM2-1 Lht cells and p3 x -Flag-TPP1-F3 
plasmids. Detailed protocols for stable transfection of HeLa-EM2-1 ht cells are 
reported elsewhere*’. HeLa-EM2-1 tht cells were co-transfected in a 6-well format 
using either Lipofectamine 2000 (Life Technologies) or TransIT (Mirus) using the 
manufacturer’s protocol with 1 jg each of the p3X-Flag-TPP1-F3 (containing 
wild type, E169A/E171A, R175V, or L212A mutants for experiments in 
Supplementary Fig. 9, and containing shTPP1 and shTPP1-resistant wild type, 
E169A/E171A, R175V, or L212A mutants for experiments in Fig. 3), and 1 pg ofa 
plasmid expressing Flp recombinase and conferring puromycin resistance. Twelve 
hours post-transfection, transfected cells were selected for 36h using 5ug ml * 
puromycin. Subsequently, fresh medium lacking puromycin but including 
50-100 1M ganciclovir (Sigma-Aldrich) was added and negative selection was 
conducted for 10 days. Next, 12 individual clones were picked from each trans- 
fection and expanded. To verify the identity of the clone and distinguish it from 
false-positive clones that survived selection, a small aliquot of cells was induced 
overnight with 200 ng ml ' doxycycline and observed under a fluorescence micro- 
scope (Supplementary Fig. 10a). Positive clones were selected based on green 
fluorescence arising from their IRES-GFP locus downstream of the Flag-TPP1 
gene (Fig. 3a) and also confirmed using flow cytometry (Supplementary Fig. 
10b). The typical efficiency of cloning (positive clones/total survivors) using this 
protocol was between 8% and 50%. Positive clones were expanded until they grew 
to confluency in 10-cm culture dishes and induced with 200 ng ml’ doxycycline 
for studying the effect of TPP1 mutations in cells. 

Telomere length analysis. Genomic DNA was isolated from confluent 10-cm 
culture dishes of the original HeLa-EM2-11ht cells and Flag-TPP1 expressing 
stable cell lines using the GenElute kit (Sigma). Genomic DNA (1.5 11g) was 
restriction digested with frequent cutters Hinfl and Rsal overnight at 37 °C. 
The DNA digests were run on a 0.8% Agarose-1X TBE gel at 50 V for a total of 
1,100 V-hours. A 5’ **P-labelled (with T4 PNK; NEB) 2 DNA-HindIII digest 
ladder (10,000 c.p.m.) was run as a marker on a separate lane on the gel. Next 
morning, the gel was shaken in 0.25M HCl for 15 min followed by two rounds 
(15 min each) of shaking in solution containing 0.5 M NaOH and 1.5M NaCl. 
Next, the gel was shaken in solution containing 0.5 M Tris-Cl and 1.5M NaCl 
(pH7.5) for 30 min. The gel was blotted on the positively charged Hybond N+ 
(GE) membrane in 10 SSC buffer (1.5 M NaCl, 0.15 M sodium citrate at pH 7.2) 
overnight by capillary blotting. Next morning, the membrane was irradiated with 
UV (260 nm; set at 1,200 LJ X 100 energy) and prehybridized in 15 ml Rapid-hyb 
(GE) for 30 min at 50°C in a hybridization oven with rotation. 5’ °2D_labelled 
(with T4 PNK; NEB) telomeric probe of sequence (TTAGGG), was added 
(20,000,000 c.p.m.) and hybridization was continued for 1 h at 50 °C. The hybridi- 
zation solution was discarded and the membrane was rinsed three times with 
buffer containing 0.1 SSC and 0.1% SDS. The membrane was washed three 
times in this buffer at 50 °C, wrapped in Saran-wrap, exposed to a phosphorimager 
screen for 24-72 h, and analysed using the Imagequant TL software. The gel was 
calibrated using the known molecular weights of the 4 DNA-HindIII ladder and 
the mean telomere length for each lane was plotted as a function of population 
doubling for each cell line. A linear regression (MS Excel) was used to calculate the 
rate of telomere elongation. 

Quantitative RT-PCR. Total RNA was purified from the indicated cell lines using 
TRIzol reagent (Life Technologies). Total cDNA was prepared from total RNA 
using the High Capacity cDNA reverse transcription kit (Applied Biosystems). A 
fragment of the cDNA from endogenous TPP1 mRNA spanning a part of the 5’ 
UTR (absent in Flag-tagged constructs) and continuing into the ORF was amplified, 
and qPCR was performed with the iQ SYBR green Supermix (Biorad) using the 
LightCycler 480 (Roche) equipment. 

Immunofluorescence (IF) and fluorescence in situ hybridization (FISH). For 
IF and FISH experiments, protocols were adapted from those described previously 
in the literature’ and online sources (http://delangelab.rockefeller.edu/protocols. 
html). For TIF analysis using co-immunofluorescence (co-IF), 10,000 cells of 
HeLa-EM2-11ht-derived stable cell lines were seeded on coverslips in a 12-well 
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culture plate containing growth medium adjusted to 200ngml~' doxycycline. 
After 96h, medium was removed and all subsequent steps were performed at 
room temperature. Cells were washed once with PBS and fixed with 3% or 4% 
formaldehyde in PBS for 8 min. The fixative was removed and the cells washed 
three times with PBS. The cells were permeabilized with PBS containing 0.1% 
Triton X-100 (PBS-T) for 5 min and blocked in PBS-T containing nuclease-free 
3% BSA for 30min. Cells were incubated with mouse monoclonal anti-TRF2 
(Imgenex; IMG-124<; 1:500) and rabbit polyclonal anti-53BP1 (Novus Biologicals; 
NB100-304; 1:1,000 dilution) in PBS-T containing nuclease-free 3% BSA for 1h. 
The cells were then washed three times in PBS (5 min each), and incubated with 
Alexa Fluor 568-conjugated anti-mouse IgG (Life Technologies) and Alexa Fluor 
647-conjugated anti-rabbit IgG (Life Technologies) diluted 1:500 in PBS-T con- 
taining nuclease-free 3% BSA for 30 min in the dark. The cells were then washed 
three times in PBS and the excess PBS was removed by blotting. The coverslips 
were mounted on microscope slides using Vectashield mounting medium with 
DAPI (Vector Laboratories), sealed using transparent nail polish and stored at 4 °C 
or —20°C until the time of imaging. Detection of Flag-TPP1 and RAP1 by co-IF 
was done essentially as above, but with mouse monoclonal anti-Flag M2 (Sigma; 
F1804; 1:500) and rabbit polyclonal anti-RAP1 (Novus Biologicals; NB100-292; 
1:500) primary antibodies. 

For combined immunofluorescence-fluorescence in situ hybridization (IF-FISH) 

experiments, the IF was performed before FISH. Briefly, IF was performed as 
described above using the appropriate primary and secondary antibodies. After 
the final PBS wash, the cells were fixed again in 4% formaldehyde in PBS for 10 min 
at room temperature. The cells were washed twice with PBS and dehydrated by 
successive 5-min incubations in 70%, 95% and 100% ethanol. The ethanol was 
removed and the coverslips allowed to dry for 2 min. The cells were rehydrated in 
50% formamide in 2X SSC for 5 min. The coverslips were placed (with cells facing 
down) on a drop (~40 ull) of pre-hybridization solution containing 100 mg ml! 
dextran sulphate, 0.125 mg ml! E. coli tRNA, 1 mg ml! nuclease-free BSA, 
0.5 mg ml | salmon sperm DNA, 1 mM vanadyl ribonucleoside complexes, and 
50% formamide in 2 SSC for 1 hat 37 °C ina humidified chamber. Hybridization 
solution was made by adding a mixture of three Cy5-conjugated TR probes” 
(30 ng of each probe per coverslip) to the pre-hybridization solution and the cells 
were hybridized in this solution overnight at 37 °C in a dark humidified chamber. 
Next morning, the cells were washed twice in 50% formamide in 2X SSC and twice 
in PBS, and the coverslips were then mounted on microscope slides using ProLong 
Gold Antifade Reagent (Life Technologies). After 24h at room temperature in the 
dark, the coverslips were sealed using transparent nail polish and stored at 4 °C or 
—20°C until the time of imaging. In addition to antibodies mentioned above, 
rabbit polyclonal anti-coilin (Santa Cruz; sc-32860; 1:100) and Alexa Fluor 405- 
conjugated anti-mouse (and anti-rabbit) IgG antibodies (Life Technologies; 1:500) 
were used to stain and image Flag~TPP1 and Cajal bodies. 
Microscopy. Imaging was performed using a Nikon TE2000-U inverted fluo- 
rescence microscope equipped with Photometrics Cascade II EM-CCD camera 
and Yokagowa Spinning disc Confocal (CSU-Xm2) (Nikon Instruments, Inc.) 
and a 60% oil objective. Images were acquired using Metamorph software. Linear 
image adjustments were made when necessary using Image] and Adobe Photoshop 
and applied to all images in a given channel. The colours depicted in the figures do 
not necessarily correspond to the emission of the fluorescent labels, but are used to 
facilitate visualization/analysis of ‘merged’ signals from two or more channels. 
Representative cells are shown in all microscopy figure panels. Quantification of 
co-localizations was done manually and plots of numbers of co-localizations per cell 
(one focal plane) show data obtained from 15 or more fields of cells (40-120 cells in 
total) for each sample set processed in parallel on the same day. 


28. Weidenfeld, |. et al. Inducible expression of coding and inhibitory RNAs from 
retargetable genomic loci. Nucleic Acids Res. 37, e50 (2009). 

29. Abell, A.N. et al. Rac2°’N, a dominant inhibitory Rac2 mutant that inhibits p38 
kinase signaling and prevents surface ruffling in bone-marrow-derived 
macrophages. J. Cel! Sci. 117, 243-255 (2004). 

30. Berger, S. M. et a/. Quantitative analysis of conditional gene inactivation using 
rationally designed, tetracycline-controlled miRNAs. Nucleic Acids Res. 38, e168 
(2010). 

31. Mossessova, E. & Lima, C. D. Ulp1-SUMO crystal structure and genetic analysis 
reveal conserved interactions and a regulatory element essential for cell growth 
in yeast. Mol. Cell 5, 865-876 (2000). 

32. Nandakumar, J., Podell, E.R. & Cech, T. R. How telomeric protein POT1 avoids 
RNA to achieve specificity for single-stranded DNA. Proc. Natl Acad. Sci. USA 107, 
651-656 (2010). 

33. Weidenfeld, |. Inducible microRNA-mediated knockdown of the endogenous 
human lamin A/C gene. Methods Mol. Biol. 815, 289-305 (2012). 


©2012 Macmillan Publishers Limited. All rights reserved 


CORRECTIONS & AMENDMENTS 


ADDENDUM 
doi:10.1038/naturel1735 


Addendum: The Cancer Cell Line 
Encyclopedia enables predictive 
modelling of anticancer drug 
sensitivity 

Jordi Barretina, Giordano Caponigro, Nicolas Stransky, 
Kavitha Venkatesan, Adam A. Margolin, Sungjoon Kim, 
Christopher J.Wilson, Joseph Lehar, Gregory V. Kryukov, 
Dmitriy Sonkin, Anupama Reddy, Manway Liu, 

Lauren Murray, Michael F. Berger, John E. Monahan, 
Paula Morais, Jodi Meltzer, Adam Korejwa, 

Judit Jane-Valbuena, Felipa A. Mapa, Joseph Thibault, 
Eva Bric-Furlong, Pichai Raman, Aaron Shipway, 

Ingo H. Engels, Jill Cheng, Guoying K. Yu, Jianjun Yu, 
Peter Aspesi Jr, Melanie de Silva, Kalpana Jagtap, 

Michael D. Jones, Li Wang, Charles Hatton, 

Emanuele Palescandolo, Supriya Gupta, Scott Mahan, 
Carrie Sougnez, Robert C. Onofrio, Ted Liefeld, 

Laura MacConaill, Wendy Winckler, Michael Reich, Nanxin Li, 
Jill P. Mesirov, Stacey B. Gabriel, Gad Getz, Kristin Ardlie, 
Vivien Chan, Vic E. Myer, Barbara L. Weber, Jeff Porter, 
Markus Warmuth, Peter Finan, Jennifer L. Harris, 
Matthew Meyerson, Todd R. Golub, Michael P. Morrissey, 
William R. Sellers, Robert Schlegel & Levi A. Garraway 


Nature 483, 603-607 (2012); doi:10.1038/nature11003 


In the Supplementary Information of this Letter, the use of distinct data 
normalization and directionality methods for pharmacological response 
calculations caused minor inconsistencies. We have therefore updated 
Supplementary Table 11 and some of the Supplementary Figures to 
resolve any confusion (see the Supplementary Information to this 
Addendum). We also wish to describe the relevant drug sensitivity 
normalization and response score calculations more completely. 

Two versions of the drug response data were generated. First, raw 
activity values were calculated at each dose as A = 100(T/U — 1), in 
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which T represents the Cell Titer Glo (CTG) level measured for the 
compound-treated well, and U is the median level of the untreated 
wells across the plate. This raw A is 0% with no drug and 100% for fully 
active compounds, when no CTG is detected. Second, the data were 
adjusted to a plate surface pattern and normalized to the MG132 posi- 
tive control, as described in the Supplementary Methods. This norma- 
lized A is also 0% with no drug, but 100% corresponds to the median 
MG132 response on that plate. Although normalized drug responses 
were used to determine ECs 9, IC59 and Aya, values, we used the raw 
drug responses for calculating the activity area (AA). This distinction is 
now clear in the corrected Supplementary Table 11 (the two AA mea- 
sures, derived from raw or normalized data, correlate closely: r = 0.98). 

The activity is the sum of differences between the measured A; 
at concentration i and A = 0, excluding positive A values: AA= 
>* i{O— min (0,A;/100)}. This AA has a value of 0 with no drug, 
and +8 for a compound inhibiting at A = 100% at all eight drug 
concentrations, as illustrated in Fig. 2b of the original Letter. We hope 
that this definition eliminates any confusion that may have existed in 
the original Supplementary Methods (page 13) and enables others to 
reproduce our AA results starting from the raw drug sensitivity data. 
As a further means of clarification, we have added three columns to 
Supplementary Table 11 showing the raw (non-normalized) response 
data necessary to calculate AA, MG132 activity, and AA derived from 
normalized response data. 

In addition, although all computational analyses used the above AA 
formula, a few Supplementary Figures (Supplementary Figs 6, 11,9 and 
14b) used a scale showing 8 — AA. This value was used for display 
purposes, so that low values corresponded to sensitive cell lines and the 
visualization remained consistent with other sensitivity metrics (ICs, 
Amax). This specification was noted in Supplementary Fig. 8 but had 
been inadvertently cut off the Supplementary Fig. 9 legend. We have 
therefore updated the Supplementary Fig. 9 legend to clarify where an 
inverted scale was used, and updated the scale of Supplementary Figs 6, 
11 and 14b to reflect our definition of AA (noted above). 

These changes do not affect the analyses, results or scientific con- 
clusions presented in the paper. The authors are indebted to B. Yadav, 
who alerted them to these inconsistencies. 


Supplementary Information is available in the online version of the 
Addendum. 
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CARL ZEISS MICROSCOPY 


TWO MICROSCOPES ARE 
BETTER THAN ONE 


Using two different kinds of imaging can give scientists a powerful combination 
of high specificity and detailed structural information. 


This image of a butterfly wing was constructed using two types of microscopy: a confocal image of the reflective eyespots shows scales (green) and wing (red); 


scanning electron microscopy reveals the different structure of non-reflective scales (upper left). 


BY CAITLIN SMITH 


here are many types of microscopy, 
each providing a unique set of benefits. 
Light microscopy of cells that express 
fluorescently labelled molecules, for example, 
lets scientists observe the movements of spe- 
cific molecules or protein complexes in live 
cells in real time, or in fixed samples. Scan- 
ning electron microscopy (EM) reveals tiny 
details of the cell surface, and transmission 


EM shows the detailed cytoarchitecture of sec- 
tions through fixed tissue. Other aspects of the 
structure of cells and tissues can be explored 
using techniques such as ion microscopy, total 
internal reflection microscopy, atomic force 
microscopy and super-resolution microscopy. 
Each type provides different information, but 
using two sorts of microscopy simultane- 
ously provides even more — and molecu- 
lar tools have been developed to link them 
together. Researchers testing the waters of this 


= 


‘correlative microscopy are beginning to dis- 
cover its challenges and rewards. 

Correlated light-electron microscopy, for 
example, provides both the specificity and 
real-time observation of light microscopy 
with fluorescent labelling and the better struc- 
tural resolution of EM. But such correlative 
microscopy has historically been tricky to use. 
Scientists who tried it might have needed to 
switch to EM halfway through an experiment, 
which meant moving the sample from one 
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Composite image created from correlated light-ion microscopy showing a fibroblast stained to show 
tubulin (red), actin (green) and the nucleus (blue). 


microscope to another that may be located 
in a different lab, or even in another build- 
ing. Not only would this journey take time, 
but delicate samples could be damaged by 
the motion, or by changes in temperature or 
humidity. 

A second obstacle is the need to ‘find back, 
which means searching the thin section pre- 
pared for EM to relocate the region observed 
under the light microscope. To help them find 
the sample, scientists could create fiducial 
markers, which act as landmarks to pinpoint 
the area of interest — they could be marks 
etched onto the sample dish, for example. 
But even then, the process is not straightfor- 
ward, and correlative 
microscopy has faced 
a slow uptake. 

Instrument manu- 
facturers have devel- 
oped several solutions 
to address these prob- 
lems. In the summer 
of 2010, Carl Zeiss, 
based in Jena, Ger- 
many, released the Shuttle & Find. This consists 
of a sample holder, enabling tissue sections to 
be transported safely, and a software module 
that connects to both microscopes and uses a 
coordinate system to find the region of inter- 
est. According to Kirk Czymmek, director of 
North American labs at Carl Zeiss Microscopy 
in Thornwood, New York, this approach offers 
a way of “relocating the region of interest in 


different microscope systems within a matter 
of minutes — a task that until now had taken 
hours, and sometimes even days”. 

Another way to make finding back easier 
is to use thicker tissue sections in EM. Cell 
biologist Judith Klumperman, director of the 
Cell Microscopy Center at the Utrecht Uni- 
versity Medical Center in the Netherlands, 
says that she uses correlative light-electron 
microscopy to characterize “distinct endo- 
some populations by their dynamics, interac- 
tions, subcellular localization, cargo, protein 
composition and ultrastructural morphol- 
ogy”. After observing the dynamics of fluo- 
rescently labelled lysosomal proteins with a 
light microscope, for example, her group uses 
EM to study their structural details. “When 
using thin 80-100-nanometre sections, these 
structures will appear in only one or two EM 
sections of the 20 to 40 that are generally 
obtained from one cell,” says Klumperman. 
“With three-dimensional electron tomogra- 
phy, we can use 300-400-nanometre sections, 
which increases the chance of finding back the 
region of interest” 

There is no need to switch between micro- 
scopes if one instrument integrates both 
fluorescence and electron microscopy. Fur- 
thermore, using the same stage reduces 
the chance of damaging the specimen and 
makes it easier to find back the region of 
interest. Zeiss’s Merlin, for example, has all 
these features and also offers the option of 
an integrated atomic force microscope (see 
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“Multimodal microscopy’) in the vacuum 
chamber of its scanning EM. A range of simi- 
lar correlative microscopes is produced by FEI 
based in Hillsboro, Oregon. 

The ClairScope, produced by Japanese 
company JEOL, based in Tokyo, has a differ- 
ent solution to the same problems. It has an 
inverted scanning EM below the culture dish, 
which can be viewed at atmospheric pressure, 
and an optical microscope above it. A win- 
dow coated with a silicon nitride film allows 
electrons to be projected from underneath 
while maintaining a vacuum between the 
EM and the sample dish. This set-up allows 
researchers to perform concurrent imaging of 
a sample in solution by both microscopes, says 
Donna Guarrera, assistant director in the SEM 
Division at JEOL USA. “There is no running 
from the optical microscope lab, then prepar- 
ing the sample to be vacuum-compatible for 
SEM imaging.” 

Despite recent improvements, techni- 
cal difficulties still limit what scientists can 
accomplish with correlated light and electron 
microscopy. Jeffrey Caplan, associate direc- 
tor of the University of Delaware's bioimaging 
centre, finds that the speed of fixation is hold- 
ing him back — he would like to study live-cell 
dynamics and then immobilize the cells rap- 
idly for EM imaging. “Current cryo-fixation 
methods take about 10 seconds to a couple of 
minutes,” he says, “but we would like a tool that 
can stop movement of dynamic structures in 
less than one second if possible?” This would 
make it easier to study dynamic events such 
as vesicle docking and trafficking, cytoskeletal 
remodelling and calcium signalling’. 

Immunogold labelling has historically been 
the gold standard for identifying subcellular 
structures in EM. However, the sheer bulk of 
gold-labelled antibodies can interfere with the 
identification of targets with single-molecule 
precision. Caplan’s group is now developing 


Correlated epifluorescence (purple) and 
atomic force microscopy (brown) images of live 
osteocarcinoma cells stained for actin. 
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Multimodal microscopy 


Correlative microscopy is becoming 
more widespread as fresh opportunities 
to correlate different types of imaging 
data arise. “Rather than just correlative 
imaging, the goal is to combine a whole 
suite of characterization methods into 
something that might better be described 
as multimodal microscopy,’ says Andrew 
Peele, director of science at the Australian 
Synchrotron in Clayton, a suburb of 
Melbourne, Australia. The synchrotron is a 
circular particle accelerator, about the size 
of a football field, where strong magnetic 
fields force high-energy electrons to travel 
round its tunnels. The intense beams of light 
it generates, usually optimized in the X-ray 
region, can be used for imaging experiments 
that may include X-ray fluorescence, X-ray 
diffraction, light microscopy, electron 
microscopy, tomography or infrared 
microscopy, says Peele. 

Despite all these options, multimodal 
imaging is not straightforward. “One 


an alternative to immunogold labelling for 
electron and super-resolution microscopy 
“to ensure that single-molecule localizations 
are accurately mapped onto the EM image’, 
he says. 


LIGHT-ION MICROSCOPY 

One drawback of correlative light-electron 
microscopy is the fixed order of tasks: fluores- 
cence microscopy must be carried out before 
EM, rather than after it, because the electron 
beam can destroy the fluorescence signal. It is 
therefore not possible to verify fluorescence 
microscopy results after EM. Scanning ion 


The Australian Synchrotron’s beamline allows fine measurements in multimodal microscopy. 


of the challenges we face is comparing 
information across widely different 
techniques,” says Peele, “especially those 
with quite different resolutions”, such as 
fluorescence data and high-resolution 
diffraction data. 

Other labs are using multimodal imaging, 
too. Molly Stevens, a biomedical materials 
professor at Imperial College London, 
hopes her research using correlative 
microscopy will advance our understanding 
of cells and their environments, perhaps 
leading to applications such as the 
design of biomaterials and scaffolds for 
tissue engineering. Her group is currently 
developing a protocol for correlating 
images from electron and ion microscopy, 
and following it with fluorescence 
imaging. “Then we will be able to correlate 
topographical information, ultrastructures 
of cells and specific biochemical changes 
using fluorescently tagged proteins,” 
she says. 0.5. 


microscopy may provide the answer. 
Biomedical materials professor Molly 
Stevens and her colleagues at Imperial College 
London recently described correlated light- 
ion microscopy (CLIM), in which fluorescence 
microscopy is correlated with ion microscopy’. 
Ion microscopy is similar to EM except that 
a beam of ions is used to scan the sample, 
instead of electrons. After imaging the fluo- 
rescent signal, Stevens and her colleagues fixed 
the sample and performed ion microscopy 
by scanning it with a beam of gallium ions. 
“T believe that helium ions might also work,’ 
says Sergio Bertazzo, a postdoctoral researcher 
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in Stevens’ lab. “One concern we have is that 
helium ions might penetrate much more into 
the sample, damaging the fluorescence signal.” 
Because the gallium ions did not damage it, the 
scientists could go back and forth between the 
two microscopy techniques. They could revisit 
the fluorescence in the sample after assessing 
its three-dimensional structure with scanning 
ion microscopy — several times for the same 
sample, in fact. 

“We have used this to study how cells inter- 
act with environments such as biomateri- 
als, scaffolds for tissue engineering, and 2D 
micropatterned surfaces’, which guide cell 
growth in specified geometries, Stevens says. 

Her group has also combined ion micros- 
copy with total internal reflection fluores- 
cence microscopy, a technique that can boost 
the resolving power to single-molecule reso- 
lution near the edges of the cell. The result- 
ing sub-micrometre scale resolution allowed 
Stevens’ group to see a migrating fibroblast 
in an intermediate step of the migration pro- 
cess, with one edge mobilized for travel and 
the opposite edge still firmly adhering to the 
substrate’. 


FLUORESCENCE-ATOMIC FORCE MICROSCOPY 
In another approach called atomic force 
microscopy (AFM), the sample is scanned by 
the tip of a sensitive cantilever probe, provid- 
ing high-resolution, three-dimensional, struc- 
tural information. Unlike electron microscopy, 
AFM can deliver three-dimensional images of 
live cells without requiring imaging agents 
such as fluorophores, and it can detect single 
molecules with nanometre resolution. 

There is a practical reason why microscopy 
companies sometimes correlate AFM and light 
microscopy. “AFM can only visualize a very 
small area at a time, so you are essentially hunt- 
ing around blind without correlation with light 
microscopy,’ says Ben Ohler, product man- 
ager for research atomic force microscopes 
at Bruker, based in Billerica, Massachusetts. 
“Some basic correlation is simply a require- 
ment to participate in the market.” 

Nicholas Geisse, a bio-applications sci- 
entist at Asylum Research in Santa Barbara, 
California, which specializes in AFM, thinks 
that AFM needs to have faster imaging to 
keep up with millisecond-scale fluorescence 
microscopy measurements. Asylum’s Cypher 
AFM was designed for faster scanning and 
data acquisition, he says. “Many of Cypher’s 
technical advancements, including the use of 
small cantilevers, have enabled high-speed 
scanning.” 

Bruker’s latest instruments are also designed 
with speed in mind, says Ohler. “Where AFM 
images have typically taken minutes per image, 
Dimension FastScan Bio now acquires images 
in seconds, or even several images per second,” 
he says. He believes that such technology can 
lead to AFM measurements of dynamic bio- 
logical events. 
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A research group in Italy is also heading 
in that direction. Physicist Alberto Diaspro, 
director of the nanophysics department at the 
Italian Institute of Technology in Genoa, is 
correlating AFM with super-resolution stimu- 
lated emission depletion (STED) microscopy 
to develop tools for topographical imaging, 
nanomechanical imaging, and measure- 
ments of cell stiffness*. In STED, researchers 
intentionally deactivate some fluorophores 
in part of the sample, which enhances the 
resolution in that area. Diaspro is optimistic 
that combining correlative-microscopy tech- 
niques with STED’s nanoscale resolution will 
reveal valuable information about cellular 
nanostructures. The rapid image acquisition 
possible with STED also holds promise for 
measuring fast events, such as intracellular 
vesicle dynamics during secretion or neuro- 
transmitter release, or cytoskeletal remodel- 
ling during cell motility. 


FLUORESCENT BRIDGES 

Researchers have long searched for a mole- 
cule that can bridge fluorescence and electron 
microscopy — a fluorescent molecule that also 
stains in EM images, for example. The ideal 
tool would be a genetically encodable molecu- 
lar tag, small enough to penetrate fixed tissues 
better than awkward gold-labelled antibodies, 
but fluorescent and capable of delivering good 
contrast in EM. Two groups at the National 
Center for Microscopy and Imaging Research 
at the University of California, San Diego, are 
investigating a variety of molecules for use 
in more than one type of microscopy. Mark 
Ellisman, the centre’s director, and biochem- 
ist Roger Tsien, who shared the 2008 Nobel 
Prize in Chemistry for developing the widely 
used green fluorescent protein (GFP) tag, often 
collaborate in what Ellisman calls “molecular 
painting” in the search for a reliable, geneti- 
cally encodable tag to label proteins in EM. 
“We were looking for the GFP of electron 
microscopy,’ says Ellisman. 

Last year, researchers from both groups 
revealed a molecular tool they had engineered 
called mini singlet oxygen generator (mini- 
SOG), which is derived from a plant photo- 
receptor’, Singlet oxygen generator molecules 
are easy to see because they fluoresce in light 
microscopy and can be stained by diaminoben- 
zidine for EM*. The group expressed proteins 
labelled with miniSOG to demonstrate its util- 
ity as a genetically encoded tag for protein tar- 
gets in EM. “MiniSOG is an extremely good 
singlet oxygen generator, so we believe this 
one will get to single-molecule sensitivity,’ says 
Ellisman. “We're working on that now.’ 

Meanwhile, miniSOG is already proving 
useful. Using a combination of microscopy 
techniques, molecular biologist Clodagh 
O’Shea and her colleagues at the Salk Insti- 
tute for Biological Studies in La Jolla, Cali- 
fornia, are studying the puzzling observation 
that small viral oncoproteins seem to hijack 


MICROSCOPY 


cellular machinery to stimulate both viral and 
pathological cellular replication. According to 
O’Shea, her group wondered: “how do small 
viral oncoproteins win?” The group imaged 
infected cells by using miniSOG to label the 
adenovirus oncoprotein E4-ORF3 (ref. 6). 
Serial block-face scanning EM gave them 
reconstructed, three-dimensional views of 
infected cells. “The scanning electron micro- 
scope slices an infected cell from top to bot- 
tom in tiny 60-nm blocks,” says O’Shea. The 
group also used electron tomography to make 
hundreds of 0.5-nm-thick computational slices 
through cells, and used specialized software 
to recreate cells from the computed slices. 
“Three-dimensional reconstructions show that 
E4-ORF3 assembles into a remarkable network 
of cables that weaves through the nucleus,” 
she says. The images showed that the weave 
physically separates 


“We have’ viral DNA replica- 
been looking tion domains from 
for the green cellular nucleoli. In 
fluorescent other words, she says, 
protein of the viral oncoprotein 
electron self-assembles into 


microscopy.” a trap for tumour 
suppressors. 

A cousin of miniSOG is also on the horizon. 
Ellisman and former postdoc Alice Ting, now 
at the Massachusetts Institute of Technology in 
Cambridge, collaborated to create an enzyme 
called APEX that has enhanced singlet oxy- 
gen generator activity’. Ellisman stresses the 
importance of genetically encoded tags for 
miniSOG and APEX. “Both are molecules we 
can introduce genetically, and both will result 
in contrast in EM,” he says. 

Researchers continue to push the limits 
of correlative microscopy. “We are currently 
developing a methodology that combines 
CLIM with super-high-resolution fluorescence 
microscopy, says Stevens. “If successful, it will 


Clodagh O’Shea and her team use a variety of techniques to investigate viral oncoproteins. 


offer a new way of studying molecular interac- 
tions.” Meanwhile, Ellisman and his colleagues 
are working on expressing two genetically 
encoded miniSOGs that have different col- 
ours. “This is what we call multicolour EM,” 
he says. “The idea is that you would be able to 
do your dynamic light microscopy, then cor- 
relate a high-resolution subvolume down to 
the molecular scale. It’s a hard project, but we 
know we'll succeed.” 

Bridging light and electron microscopy with 
genetically encodable fluorescent tags may one 
day be as routine as labelling with GFP is today. 
High-throughput methods applied to tissue 
sections may make super-resolution micros- 
copy faster and easier. When that happens, 
two — or perhaps more — microscopes will 
definitely be better than one. m 


Caitlin Smith is a freelance science writer 
based in Portland, Oregon. 
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CORRECTION 

The Technology Feature ‘Reading the second 
genomic code’ (Nature 491, 143-147; 
2012) wrongly stated that Constellation 

and Genentech are collaborating to 

develop inhibitors of BET proteins and 

EZH2 chromatin-writers. Although they are 
working together on inhibitor development, 
the targets are not those mentioned. 
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Balancing act 


Many researchers struggle to take time off from the lab. But 
scientists should try to improve their work-life balance. 


BY QUIRIN SCHIERMEIER 


sa PhD student and later a postdoc at 

the University of California, Berkeley, 

Axel Meyer almost had to force himself 

to have a life outside the laboratory. Wanting to 

experience the arts, he worked the odd evening 

as an usher at Zellerbach Hall, Berkeley’s on- 

campus performance theatre. If time allowed, 

he would also play in the university’s field- 

hockey team. But on most days, his leisure 

activity was limited to riding his motorcycle 

to campus early in the morning and back home 
late at night. 

Working long hours and at weekends was 


common in Berkeley's zoology department, 
from which Meyer received his PhD in 1988. 
As an evolutionary-biology postdoc, Meyer 
says, 80-hour working weeks were the rule 
rather than the exception. 

“You might think we were brainwashed to 
work so much, and in a positive sense I guess 
we were,’ says Meyer, now chair of zoology 
and evolutionary biology at the University 
of Konstanz in Germany. But, he adds, the 
research was so exciting and he felt so privi- 
leged to be part of his supervisor's group that it 
never crossed his mind that he might be work- 
ing too hard. 

Still, for many early-career researchers the 


joy of doing science is tarnished by heavy 
workloads and excessive demands. Personal 
interests — and maybe even a family life — 
are often sacrificed for the sake of producing 
the papers and grants that pave the way to per- 
manent jobs in academia. And yet a healthy 
work-life balance can be more fruitful than 
endless hours in the lab or in front of the lap- 
top. Scientists — and their work — can benefit 
from setting aside time for hobbies, travel or 
just about anything else that does not directly 
intersect with lab activities. 


AHARD DAY’S NIGHT 

The working habits of scientists are poorly 
monitored, but an analysis of download pat- 
terns of scientific literature across all fields sug- 
gests that many researchers work after hours 
or at weekends (X. W. Wang et al. J. Informetr. 
6, 655-660; 2012). Using a tool devised by 
Springer publishing, the researchers tracked 
when people downloaded papers from the pub- 
lisher across five weekdays and four weekend 
days in April 2012. Working late into the night 
is particularly widespread among scientists in 
the United States, whereas Chinese researchers 
work many hours on the weekend, they found 
(see ‘Literature searches around the clock’). 

“Scientists are generally under great pressure, 
and most young scientists are spending much 
more time working than they initially expected 
when they were undergraduate students,” says 
co-author Shenmeng Xu, a master’s student at 
the Dalian University of Technology in China. 

At Chinese universities, she says, even mas- 
ter’s and PhD students are expected to dedi- 
cate some 60 hours a week to their studies. 
Constantly working overtime is exhausting, 
physically and mentally, and limits students’ 
social life. “I do some swimming for exercise,” 
she says. “But other than that I can't afford any 
entertainment and I can go home no more 
often than twice a year to see my parents. I like 
what I’m doing but I do wish I could have a 
more balanced life.” 

Demands on scientists have greatly increased 
as research has become more data-intensive in 
the past few decades, says Julie Overbaugh, an 
HIV researcher at the Fred Hutchinson Cancer 
Research Center in Seattle, Washington. Gen- 
erating and sifting through large amounts of 
data takes time, and using specialized equip- 
ment — which is often shared between groups 
of researchers — can mean reserving time 
slots and thus working at odd hours. Striking a 
healthy balance between work and a personal 
life is difficult, she says, but more important > 
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> than many hard-working young scientists 
and their supervisors will admit. 

“Prioritizing the things that are most impor- 
tant for you is key, as is evaluating whether you 
are using your time wisely,” says Overbaugh. 
“If you feel you're missing out on things that 
are really important for you — family, friends 
or hobbies — something is wrong” 


TAKE A BREAK 

For Daniel Mietchen, a biophysicist and web- 
tool developer who earned his PhD in 2006 
from Saarland University in Saarbriicken, 
Germany, that important something out- 
side science is playing in a band. A singer in 
a traditional Central Asian song and dance 
group, he took pains to organize his doctoral 
research on mapping brain structure with 
nuclear magnetic resonance imaging so as to 
allow him sufficient time for rehearsals and 
gigs with his Berlin-based music collective. 
He even managed to persuade his supervisor 
to grant him four weeks of educational leave 
in 2004 to improve his Uzbek language skills 
in Samarkand. “Td strongly recommend that 
anyone complement their research work with 
at least one non-scientific activity that you 
really enjoy,’ he says. “As for me, I get my best 
ideas in unfamiliar surroundings, often while 
travelling, but rarely in the lab?” 

Fearing disapproval from colleagues and 
superiors, few early-career scientists treat 
themselves to such out-of-office time. But 
time off should be part of any sensible research 
schedule, says Sabine Lerch, an independent 
soft-skills instructor who frequently coaches 
German PhD students on time management. 

Young scientists, she adds, should rid them- 
selves of “imaginary demands” such as work- 
ing extra hours in the lab. “You will achieve 
more in one productive day than on a series of 
days in poor mental and physical condition,” 
she says. “Students tend to think about rec- 
reation last when they structure their research 
work — if they structure their time at all — but 
everybody needs breaks.” Lerch suggests that 


scientists keep at least one weekend day clear 
of any professional duties and set aside time on 
work days for exercise and hobbies. 

Even short breaks from intense work help 
recharge creativity, agrees Overbaugh. In 
science, she says, success is not necessarily a 
function of the amount of time spent in the lab 
or at the computer. Scientists are more likely 
to produce new ideas and insights when they 
are not under deadline pressure, she suggests. 

Time-management guidance from advis- 
ers can be invaluable. When HIV researcher 
Jennifer Kerubo Maroa first arrived in Seattle 
from Kenya to do a PhD at the Hutchinson 
Center, her supervisor, Overbaugh, told her 
more than once that getting herself and her 
family settled was more important than the lab. 
“At first, I didn’t quite believe her, but I eventu- 
ally accepted she was being truthful with her 
advice,’ says Kerubo 


Maroa. “When I “Students tend 
arrivedin the United [0 think about 
States I thought all recreation last 
that counts is work. — but everybody 


Knowing thatI could needs breaks.” 
leave whenever I 

needed to take care of my family’s needs in turn 
allowed me to organize a balanced schedule 
between the lab and my kids” 

“Of course there were many times after she 
got settled that she worked very hard on her 
science,’ says Overbaugh. “But she also made 
sure to keep her family life in balance.” Kerubo 
Maroa has since received the distinguished 
student award from Hutchinson, produced 
a couple of papers, and accepted an offer of 
a postdoc position at the KwaZulu-Natal 
Research Institute for Tuberculosis and HIV in 
Durban, South Africa, a collaboration between 
the US Howard Hughes Medical Institute and 
the University of KwaZulu-Natal. 

At the Hutchinson Center, students can get 
advice on work and work-life balance from 
mentoring committees that include three 
senior faculty members from across the spec- 
trum of age, ethnicity and career level. The 


LITERATURE SEARCHES AROUND THE CLOCK 


The number of research papers downloaded each hour on 12 April 2012 varied by country. 
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committees are there to guide early-career 
researchers informally, as well as to formally 
evaluate PhD students’ annual progress. 


POINT OUT PRIORITIES 

If supervisors are not understanding about 
time pressures, making a specific plan can obvi- 
ate conflicts. Lerch advises students to make a 
project plan, laying out what is to be done and 
by when, and to revisit it regularly. She suggests 
that they then make a shorter-term plan with 
more specific goals. Ifa supervisor asks for extra 
tasks, the student can point out what other aims, 
already planned, will have to fall by the wayside. 
And scientists should try not to take on too 
much, says Lerch — they should not be afraid 
to say no to taking on administrative tasks and 
other advisory roles. 

Such measures — in conjunction with the 
support of trusted colleagues, friends and 
career coaches — can help to mitigate the 
stress created by demanding supervisors. In 
extreme cases, if there are misconduct issues or 
breaches of employment law, scientists should 
seek advice from an ombudsman or PhD 
organization, such as the European Council 
of Doctoral Candidates and Junior Research- 
ers (Eurodoc) in Brussels. 

Even with scientists’ best efforts, progress is 
difficult. Many countries have been hit by the 
financial crisis, for example, forcing young sci- 
entists to fight even harder for jobs and fund- 
ing — which means more demands on their 
time. “Across hard-hit southern Europe, most 
of those who do still have work are now doing 
a double job, including full-time teaching and 
full-time research, for one salary,’ says Euro- 
doc president Slobodan Radi¢ey, a doctoral 
candidate in industrial engineering at the 
University of Novi Sad in Serbia. Few young 
researchers, he adds, have anything resembling 
a healthy work-life balance. For Radi¢ev him- 
self, this balance is a pipe dream. Although his 
studies are in Serbia, he spends much of his 
time in Italy, where his wife has a science job. 
He says that his situation only works because 
his supervisor is sympathetic — and because 
flights are cheap. But when jobs are in short 
supply, it is tempting to prioritize career obliga- 
tions over personal challenges. 

However, not everyone agrees that working 
long hours is a bad thing. Meyer tries to convey 
to his students and postdocs the inexhaustible 
thrill that he experienced during his hard-work- 
ing Berkeley years. Meyer is demanding, even 
elitist. He hopes that all his lab members will 
want to become researchers or professors, and 
sets high standards for his group. Those who 
really love what they do should not lament the 
long hours, he opines. “I cart force anyone to 
work more than they want; he says. “But it does 
hurt me when I come in to my lab ona weekend 
and find only a couple of people at work? = 


Quirin Schiermeier is Nature’s Germany 
correspondent. 


TURNING POINT 


Patrik Rorsman 


Physiologist Patrik Rorsman accepted a 
prestigious 7-year, Can$10-million (US$10- 
million) grant to serve as the Canadian 
Excellence in Research Chair in Diabetes at 
the University of Alberta in Edmonton. But 
last autumn, after seven months in the post, 
he returned to the University of Oxford, UK, 
forgoing the generous funding. 


What persuaded you to accept the Canadian 
Excellence in Research Chair? 

I'ma bit ofa gypsy as a scientist. I had made 
two other big moves before settling at Oxford. 
So when Alberta approached me with sig- 
nificant funding and plans to spend mil- 
lions expanding their facilities, I was easily 
seduced. A former postdoc and valued col- 
league of mine, Patrick McDonald, is now a 
research star there, and the prospect of work- 
ing with him again was appealing. Alberta has 
access to human pancreatic islet cells, which 
are the focus of my work. And my children 
would be finishing school in the United 
Kingdom in a few years, so my family and I 
thought it was a good time to move. 


Describe the challenges you faced at the 
University of Alberta. 

Ihad no complaints about the university or 
the faculty; our problems were more related 
to human resources. My wife, who had been 
promised a job in university administration, 
had problems negotiating holiday time — 
which was important to her, given that our 
children were to stay in the United Kingdom. 
Ultimately, she did not accept a position. And 
although my staff members were told that 
their immigration would be fast-tracked, 
several people were experiencing difficulties. 


Was it a difficult decision to return to Oxford? 
Yes, for several reasons: my salary was signifi- 
cantly higher in Canada, for example. But I 
enjoy being on the same continent as my wife, 
and she didn't want to sacrifice her career. It 
is unfortunate that she and the university 
couldn’t come to an agreement. It would 
have been a great experience and opportunity. 
And, because I am Swedish, I was looking for- 
ward to being near mountains and snow. Part 
of me regrets that it didn't work out. 


Did you create a safety net at Oxford in case 
it didn’t work out in Canada? 

Not deliberately, but a few factors left the 
door open. I had decided to take a year of 
leave from Oxford so that my family could 
stay in our UK home for the first year that I 


was in Canada. And although I had never 
intended to come back, I had submitted 
a grant application to the Wellcome Trust 
biomedical research foundation before I left, 
as a goodwill gesture to Oxford, and ended 
up getting it. 


Do you plan to continue collaborating with 
colleagues at Alberta? 

I certainly hope we will continue working 
together in future. For example, one of my 
postdocs, Matthias Braun, with whom I’ve 
published almost 30 papers, came to Alberta 
with me, and decided to stay. 


Would you encourage other nations to start a 
research-chair programme like Canada’s? 
Yes. But it takes a lot of courage, because it 
involves federal dollars and there will always 
be opposition from people who want to see 
the money used in other ways. But if you feel 
that the science needs an influx of new ideas 
and people, this is the programme you need. 
It will help to get some really good people — 
you need significant funding to draw inter- 
national talent. 


What advice would you give to people 
considering offers from foreign universities? 
It depends on who you are and your domestic 
situation, but I encourage colleagues to take 
advantage of this perk of our profession — 
the thrill of living and working in a different 
place. One of the big problems in science is 
that there is too little mobility between coun- 
tries. But it is important to make sure that 
you negotiate the most favourable terms pos- 
sible. It is such an upheaval to move from one 
country to another. = 
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SWEDEN 
Basic-research boost 


One of Sweden's largest private research 
funders has awarded the first 30 grants in 
its early-career fellowship programme. The 
Wallenberg Academy Fellows initiative of 
the Knut and Alice Wallenberg Foundation 
in Stockholm aims to “kick-start” Swedish 
science, especially basic research, says 
Goran Sandberg, the foundation’s executive 
director. “Sweden is still performing well, 
but we need young people to focus on 
science,’ he says. At least 250 fellows across 
all disciplines will be funded. Researchers 
in and outside Sweden are nominated by 
Swedish universities, which will hire them 
as faculty members if they are selected. 
Grantees receive 7.5 million kronor 
(US$1.1 million) over 5 years, with the 
potential for a 5-year renewal. Applications 
for the next round are due by 1 March. 


UNITED KINGDOM 

Visa policy warning 

A fall in the number of students entering 
the United Kingdom could cause problems 
for graduate courses in science, technology, 
engineering and maths (STEM), warns 
Universities UK (UUK) in London, which 
represents more than 130 institutions. 
According to the Office for National 
Statistics, 8.2% fewer international students 
arrived in the country in the year to March 
2012 than in the previous year. The UUK 
says that the drop may be linked to changes 
that make visas harder to get. Jo Attwooll, 

a policy adviser for the UUK, notes that 
UK graduate STEM classes have tended 

to include a lot of non-European students. 
“Immigration policy could affect the 
number of students taking these courses 
and thus their viability,” she says. 


UNITED STATES 


Graduates can go abroad 


The US National Science Foundation 
(NSF) has expanded an international 
collaboration initiative for its graduate 
fellows. Fellows may now apply to spend 
3-12 months at an institution in Norway, 
Finland, Denmark, Sweden, Japan, 

South Korea, Singapore or France, with 

a living allowance from the host nation 
and travel subsidized by the NSE “In this 
era of science with more international 
collaboration and interdisciplinary work, 
we want to encourage our graduate 
students to step into those environments,’ 
says Maria Zacharias, a spokeswoman 

for the NSE. Applications must be in by 

1 February; awardees will be told in April. 
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AN UNINTENDED FUTURE 


BY TRISTAN SCOTT 


¢C Cc here's another one! That’s five fish- 

eyes. I win.” The croaking voice 

of old Mr Duvelle mocked his 

equally seasoned friend as they looked out 

of the window of the geriatric ward. He was 

referring to a man outside whose eyes had 

regressed into an earlier form, resembling 
the ‘pin-hole’ eyes of some fish. 

Mr Higgins lifted his cane with a liver- 
spotted hand, and stabbed at a hairy fellow 
who was shuffling down the street on his 
knuckles. “Ah darn-it,” he wheezed through 
his oxygen mask, “another monkey-boy and 
I would have won. Five seconds too late?” 

“Five decades too late,’ Sullivan thought 
to himself as he overheard the two men’s 
conversation from a few feet away. He too 
looked out at the extensive array of genetic 
regressions that passed by: a woman with a 
tail, a man without an opposable thumb. 

If these kinds of aberrations were the 
worst of it, it would not have been so dis- 
astrous, but at least half of the regressions 
were lethal, and that number got worse with 
each generation. 

“Tt was a simpler time? Mr Duvelle con- 
tinued to Mr Higgins’s affirming grunts. 
“All we had was that whole climate-change 
fiasco.” 

“And look how we fixed that!” Mr Hig- 
gins chimed in. “But kids these days, they 
don't know a thing about banding together 
for a common cause. Now they have this... 
induced ...what do they call it again?” 

“Huh?” the partially deaf Mr Duvelle 
leaned closer, straining to hear his friend. 

He pointed a gaunt hand at the people 
outside and shouted: “What do they call 
this again?” 

“Malform? 

They nodded together as Mr Higgins 
mumbled, “Yes, induced malform evolu- 
tion” 

“My granddaughter, I don’t even know 
if we are related, said Pastor Kane, a man 
who had aged well, with only fine lines on 
his dark skin. “She is not of my blood!” He 
continued to preach to his small congrega- 
tion of three elderly women across the room, 
who were fanning themselves. “We thought 
we knew what we were doing! And for all 
our improving, look at us now.” 

“What was the difference again?” asked 
Mr Duvelle loudly. “That reporter, what did 
she say the difference for a new species was? 
You know, the pretty one with the nice teeth” 
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The next great threat. 


“0.004%, Mr Higgins said before he took 
another deep gasp of oxygen. 

“That’s it. 0.004%.” Mr Duvelle’s head 
bobbed on his thin neck like a vulture. 
“That’s the difference between a Neander- 
thal and us. The difference between us and 
them.” 

If only the difference was that small, Sul- 
livan silently wished to himself. It is closer 


to 1% in really bad cases. Those ones aren't 
human any more. 

He stared out of the window at a world he 
did not recognize. It had become a fad after 
companies commercialized equipment that 
could easily replace genes, especially the ones 
that nature never gave you: a year-round tan, 
blue eyes, a better metabolism. The technol- 
ogy was originally intended to cure diseases, 
and cure them it did, but eventually it filtered 
down to be used for what the popular media 
called ‘genetic tattoos: 

It had seemed harmless at first, but over 
time, much like the slow, silent accumulation 
of greenhouse gases, it built. A change to a 
gene here, a deletion there, a little bit left in 
then passed on to the next generation, then 
more changes added to that. The changes 
were cumulative and, before they knew it, the 

visible signs started 
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have been meddling,’ the pastor announced 
again. “And now — can it be fixed?” 

Sullivan wasn't sure if it could, once you 
have mixed water and jelly, it is hard to 
separate them. Now it was as if they had 
mixed hundreds of flavours together and 
had left them to set for far too long. Natu- 
ral genes had become dependent on the 
artificial ones. If they removed the one, the 
other would be affected. It was incredibly 
difficult, if not impossible, to tease apart 
the extensive networks that had become 
so intricately linked. He scooped up a bit 
of jelly from his tray and sucked it into his 
toothless mouth. 

“Time to take a sample, Doctor Sullivan,” 
said a young nurse with pseudo-gills along 
her neck. 

He lifted up his sleeve and exposed a thin 
arm riddled with tiny puncture scars. The 
nurse placed a thin, silver canister on his arm 
and with a clipping sound another lump of 
tissue was excised. She puta plaster on the 
wound, and then left to prepare the sample. 
His arms looked no different from those of 
everyone else in the ward. All of them had 
hundreds of star-shaped scars, an indication 
of as many failed experiments. 

“Lord knows we are the last of the pure 
ones,’ the pastor preached, and the three 
woman sounded ‘Amens to his sermon. “We 
are the unedited!” He cast an outstretched 
finger to the window. “And we are their sal- 
vation!” 

Sullivan had heard enough. He stood 
up slowly and reinserted his dentures. The 
stinging of his arm was a piercing reminder 
of the work he had yet to achieve, as he shuf- 
fled through the hospital. He donned his 
lab coat, which felt as heavy as the guilt that 
would not let him rest. He looked up at his 
old award — the engraved image of Alfred 
Nobel seemed purposefully turned away, as if 
shunning him. It was, however, for the same 
reason that he had received the award, that 
humanity now faced its next, great threat. 

He took a dispirited look at the sample 
taken from him, and gazed along the rows 
of tubes containing a hundred others. It was 
a vain hope, but somehow he had to figure 
out an answer: how do you backtrack up the 
evolutionary tree? m 


Tristan Scott is a PhD student at the 
University of the Witwatersrand in the 
field of therapeutic RNA interference. His 
first fiction novel, Vertical City, has been 
accepted for publication. 
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